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Abstract. In this paper we study the logical aspects of branching automata, as defined 
by Lodaya and Weil. We first prove that the class of languages of finite N-free posets 
recognized by branching automata is closed under complementation. Then we define a logic, 
named P-MSO as it is a extension of monadic second-order logic with Presburger arithmetic, 
and show that it is precisely as expressive as branching automata. As a consequence of the 
effectiveness of the construction of one formalism from the other, the P-MSO theory of the 
class of all finite N-free posets is decidable. 


1. Introduction 

Sequential programs can naturally be modeled with Kleene automata, or equivalently with 
rational expressions, finite monoids, or monadic second-order (MSO) logic. The algebraic 
approach of automata provides an huge toolbox for the study of properties of programs, 
and has been widely used as a base for a lot of algorithms that manipulate logic formulas. 
The links between Kleene automata, rational expressions, finite monoids and MSO have 
important consequences in a lot of domains of computer science and mathematics, some 
of them are concretely applied as for example in program verification, others are more 
theoretical as for example in set theory. 

Introducing commutativity allows access to models of programs with permutation of 
instructions, or to concurrent programming. Among the formal tools for the study of 
commutativity in programs, let us mention for example Mazurkiewicz’s traces, integer vector 
automata or commutative monoids. In this paper we focus on the notion of branching 
automata introduced by Lodaya and Weil |LW98al ILW98bl ILWOOl ILWOlj . Branching 
automata are an extension of Kleene automata with particular transitions that naturally 
model parallelism. Traces of paths in branching automata are not (totally ordered) words as 
in Kleene automata, but partially ordered sets (posets) of letters, which are structured as 
traces of programs using the fork-join primitive for concurrency. Those particular posets, 
called N-free , are widely used in the study of concurrency. The fork-join primitive splits 
an execution flow / into fi, ■ ■ ■, f n concurrent execution flows and joins /i,..., f n before 
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it continues. Divide-and-conquer concurrent programming naturally uses this fork-join 
principle. Lodaya and Weil generalized several important results of the theory of Kleene 
automata to branching automata, for example, a notion of rational expression with the same 
expressivity as branching automata. They also investigated the question of the algebraic 
counterpart of branching automata: the sp-algebras are sets equipped with two different 
associative products, one of them being also commutative. Contrary to the theory of Kleene 
automata, branching automata do not coincide any more with finite sp-algebras. 

An interesting particular case is the bounded-width rational languages ILWOO] . where 
the cardinality of the antichains of the posets of languages are bounded by an integer n. They 
correspond to fork-join models of concurrent programs with n as the upper bound of the 
number of execution flows (n is the number of physical processors). Bounded-width rational 
languages have a natural characterization in rational expressions, branching automata, and 
sp-algebras. Taking into account those characterizations, the expressiveness of branching 
automata corresponds exactly to the finite sp-algebras. Furthermore, Kuske [ KusflO] proved 
that in this case, branching automata coincide also with monadic second-order logic, as it is 
the case for the rational languages of finite words. As in the general case monadic-second 
order logic is less expressive than branching automata, the question of an equivalent logic 
was left open. 

In this paper we investigate the question of defining a logic equivalent to branching 
automata in the general case. 

This paper contains two results regarding branching automata: 

• we prove that the class of languages defined by branching automata is closed under 
complementation; 

• we define a logic, named P-MSO as it is basically monadic second-order logic enriched 
with Presburger arithmetic, that is exactly as expressive as branching automata. 

The (effective) proof of the first result essentially relies on the closure under complemen¬ 
tation of rational languages of a finitely generated commutative monoid (Theorem 4.2 by 
Schiitzenberger and Eilenberg). The proof of the second result relies on effective construc¬ 
tions from one formalism to the other. As a consequence, the P-MSO theory of the class of 
finite N-free posets is decidable. 

The paper is organized as follows. Section[2]recalls basic definitions on posets. Section[3]is 
devoted to branching automata, rational expressions and sp-algebras. The complementation 
of rational languages is discussed in Section [4j Section [5] is devoted to the logical approach 
of branching automata. Finally, we present an alternative definition for branching automata 
in Section [6l 


2. Notation and basic definitions 

Let E be a set. We denote by V(E), V + (E) and A 4 >1 (E) respectively the set of subsets of 
E, the set of non-empty subsets of E and the set of multi-subsets of E with at least two 
elements. For any integer n, the set {1,..., n} is denoted [n] and the group of permutations 
of [n] by S n . The cardinality of E is denoted by \E\. We use the same notation for sets and 
multi-sets. We sometimes denote by e k the multiplicity k of an element e of a multi-set. 

A poset (P, <p) is composed of a set P equipped with a partial ordering <p. In this 
paper we consider only finite posets. For simplicity, by poset we always mean finite poset. 
A chain of length n in P is a sequence p\ <p ■ ■ ■ <p p n of elements of P. An antichain E 
in P is a set of elements of P mutually incomparable for <p. The width of P is the size of 
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a maximal antichain of P. An alphabet is a nonempty finite set whose elements are called 
letters. A poset (P, <p, p ) labeled by A is composed of a poset (P, <p) and a map p : P -A A 
which associates a letter A with any element of P. Observe that the posets of width 1 
labeled by A correspond precisely to the usual finite words: finite totally ordered sequences 
of letters. Throughout this paper, we use labeled posets as a generalization of words. In 
order to lighten the notation we write P for (P, <p,p) when no confusion is possible. The 
unique empty poset is denoted by e. 

Let (P,<p,pp) and (Q, <q, Pq) be two disjoint posets labeled respectively by the 
alphabets A and A'. The parallel product of P and Q , denoted P || Q , is the set P U Q 
equipped with the orderings on P and Q such that the elements of P and Q are incomparable, 
and labeled by Au A' by preservation of the labels from P and Q. It is defined as (PUQ, <, p) 
where x < y if and only if: 

• x, y G P and x <p y or 

• x,y G Q and x <q y 

and p{x) = pp(x ) if x E P, p(x) = Pq{x) if x E Q. 

The sequential product of P and Q, denoted by P • Q or PQ for simplicity, is the poset 
(PL)Q,<, p) labeled by A U A’, such that x < y if and only if one of the following conditions 
is true: 

• x E P, y E P and x <p y; 

• x E Q, y E Q and x <q y; 

• x E P and y E Q 

and p(x) = pp(x ) if x E P, p(x) = Pq{x) if x E Q. 

Observe that the parallel product is an associative and commutative operation on 
posets, whereas the sequential product does not commute (but is associative). The parallel 
and sequential products can be generalized to finite sequences of posets. Let ( Pi)i< n be a 

sequence of posets. We denote by f{ f< n -f* = Po.P n and ||j< n Pi = Pq || • • • || P n . 

The class of series-parallel posets, denoted SP, is defined as the smallest set containing 
the posets with zero and one element and closed under finite parallel and sequential product. 
It is well known that this class corresponds precisely to the class of N-free posets |Val 781 
IVTL 82 | . in which the exact ordering relation between any four elements xi,X2,x$, X4 cannot 
be x\ < X2, X3 < X2 and X3 < X4. The class of series-parallel posets labeled by an alphabet 
A is denoted SP(A). We write SP + for SP — {e} and SP + (A) for SP + (A ) — {e}. 

A poset P has a sequential (resp. parallel) factorization if P = P\-P2 (resp. P = P\ || P2) 

for some nonempty posets Pi and P2. A sequential factorization P = P\ . P n is maximal 

if each Pi, i E [n], has no sequential factorization. The definition of the notion of maximal 
parallel factorization is similar. Posets having a parallel factorization are called parallel 
posets. The sequential posets are those of cardinality 1 and those having a sequential 
factorization. 


3. Rational languages, automata and recognizability 

A language of a set A is a subset of X. Let A be an alphabet. The sequential and 
parallel product of labeled posets can naturally be extended to languages of SP(A). If 
L\, L2 C SP(A), then L\ • L2 = {Pi • P2 : Pi G L\, P2 E L2} and L\ || L2 = {Pi || P2 : Pi G 
L\, P2 G L2}. 
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3.1. Rational languages. Let A and B be two alphabets and let P E SP(A), L C SP(B) 
and ( £ i. We define the language Lo^PC SP(A U B ) by substituting non-uniformly 
in P each element labeled by £ by a labeled poset of L. This substitution Lo £ is the 
homomorphism from (SP(A), ||, •) into the power-set algebra (V(SP(A U B)), ||, •) with 
a i->- {o} for all a E A, a A £, and £ i —> L. It can be easily extended from labeled posets to 
languages of posets. Using this, we define the substitution and the iterated substitution on 
languages. By the way the usual Kleene rational operations |Kle56| are recalled. Let L and 
L' be languages of SP(A ): 

L ot L' = U L oc P 

5 PeL 1 5 

L*^ = U L* with L°Z = {£} and L (i+1)? = ( U L*) o £ L 
ieN j<i 

L* = {fl Pi : n E N, Pi E L} L + = { J] Pi : 0 < n E N, Pi E L} 

i<n i<n 


A language L C SP + (A) is rational if it is empty, or obtained from the letters of the 
alphabet A using usual rational operators : finite union U, finite concatenation •, and 
finite iteration + , and using also the finite parallel product ||, substitution o £ and iterated 
substitution *£, provided that in L*^ any element labeled by £ in a labeled poset P E L is 
incomparable with another element of P. This latter condition excludes from the rational 
languages those of the form (a£6)*^ = {a n £6 n : n E N}, for example, which are known to be 
not Kleene rational. Observe also that the usual Kleene rational languages are a particular 
case of the rational languages defined above, in which the operators ||, o £ and are not 
allowed. 

Example 3.1. Let A = {o, 6, c} and L = co £ (a || (bf))*^. Then L is the smallest language 
containing c and such that if x E L, then a || (bx ) E L: 

L = {c,a || (be), a || ( b(a || (be ))),... } 

Let L be a language where the letter £ is not used. In order to lighten the notation we 
use the following abbreviation: 

L® = {e} o £ (L || £)*« = {|| i<n P % : n E N, P E L} L® = L® - {e} 

L* and L + are the sequential iterations of L whereas L® and L ® are its parallel iterations. 

3.2. Branching automata. Branching automata are a generalization of usual Kleene 
automata. They were introduced by Lodaya and Weil [LW98al LW98b , ILWOO] , 

A branching automaton (or just automaton for short) is a tuple A = (Q, A, E, I, F) 
where Q is a finite set of states, A is an alphabet, I C Q is the set of initial states, F C Q 
the set of final states, and E is the set of transitions of A. The set of transitions of E is 
partitioned into E = (E se(i , E{ or k, E- ]0 \ n ), according to the different kinds of transitions: 

• E se q C (Q x A x Q) contains the sequential transitions, which are usual transitions of 
Kleene automata; 

• Efork CQx M yl (Q) and Ej 0 - m C M. >l (Q) x Q are respectively the sets of fork and join 
transitions. 
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Sequential transitions (p, a, q) G Q x A x Q are sometimes denoted by p A q. The arity of a 
fork (resp. join) transition (q,R) G Q x Af >1 (Q) (resp. (P, g) G 7 W >1 (( 5 ) x Q) is |P|. 

We now turn to the definition of paths in automata. The definition we use in this paper 
is different, but equivalent to, the one of Lodaya and Weil |LW 98 al ILW 98 bl ILWOOl ILWOlj . 
Paths in automata are posets labeled by transitions. A path 7 from a state p to a state q is 
either the empty poset (in this case p = q), or a non-empty poset labeled by transitions, 
with a unique minimum and a unique maximum element. The minimum element of 7 is 
mapped either to a sequential transition of the form (p, a, r) for some a G A and r G Q or to 
a fork transition of the form ( p,R ) for some R G Ad >1 (Q). Symmetrically, the maximum 
element of 7 is mapped either to a sequential transition of the form (r 7 , a, q) for some a G A 
and r' G Q or to a join transition of the form (R',q) for some R' G M >l (Q). The states p 
and q are respectively called source (or origin) and destination of 7. Two paths 7 and y 7 
are consecutive if the destination of 7 is also the source of 7'. Formally, the paths 7 labeled 
by P G SP + (A) in A are defined by induction on the structure of P: 

• for any transition t = (p, a, q ), then t is a path from p to q, labeled by a; 

• for any finite set of paths {7o,---,7fc} (k > 1 ) respectively labeled by Pq,, P/. G 
SP + (A), from po,. ■ ■ ,Pk to qo, ■ ■ ■, qk , if t = (p, {po,... ,Pk}) is a fork transition and 
t' = ({go, • • ■ ,qk},q) a j°i n transition, then 7 = t(\\j<k 7 j)t' is a path from p to q and 
labeled by \\j<k Pj] 

• for any non-empty finite sequence 70, ■ ■ ■, 7fc of consecutive paths respectively labeled by 
Po,..., Pfc, then n,-<*+i 7 j is a path labeled by ri;<fe+i Pj from the source of 70 to the 
destination of 7 

Observe that non-empty paths are labeled posets of two different forms: t or tPtf for some 
transitions t, t' and some labeled poset P. In an automaton A. the existence of a path 7 
from p to q labeled by P G SP(A) is denoted by 7 : p= 4 > q. A state s is a sink if s is the 
destination of any path originating in s. 

A labeled poset is accepted by an automaton if it is the nonempty label of a path, called 
successful , leading from an initial state to a final state. The language L(A) is the set of 
labeled posets accepted by the automaton A. A language L is regular if there exists an 
automaton A such that L = L(A). 

Example 3 . 2 . Figure [l] represents an automaton on the alphabet A = {a, 6} that accepts 
P G SP + (A ) iff P contains at least one a. It has one initial state 1 and one final state 2 , two 
sequential transitions labeled by a from 1 to 2 and from 2 to 2, two sequential transitions 
labeled by b from 1 to 1 and from 2 to 2, two fork transitions (1, {1,1}) and (2, {2, 2}), and 
three join transitions ({1,1},!), ({2, 2}, 2) and ({1,2}, 2). 



Figure 1 : An automaton that accepts P G SP + (A ) iff P contains at least one a. 


Theorem 3.3 (Lodaya and Weil I .W 9 Saj j. Let A be an alphabet, and L C SP + (A). Then 
L is regular if and only if it is rational. 
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Example 3.4. On its left side, Figure [2] represents the automaton 

-4 = ({!) 2,3,4,5, 6}, {a, 6}, E, {1}, {6}) 

where the set of sequential transitions is E seq = {(2, a, 4), (3,6,5)}, the set of fork tran¬ 
sitions is Ef or k = {(1, (1,1}), (1, (2, 3})} and finally the set of join transitions is Ej 0 j n = 
{({6, 6}, 6), ({4, 5}, 6)}. On the right side of the Figure is pictured an accepting path labeled 
by a || 6 || a || 6. Actually, L(A) = (a || 6)®. 



Figure 2: An automaton A with L(A) = (a || 6)® and an accepting path labeled by 
a || 6 || a || 6. 


Example 3.5. Let A = {a, 6, c} and L = c (o || (6£))*^ be the language of Examples 
and |4.7| Figure [3] represents an automaton A such that L(A) = L. 


3.1 



Figure 3: An automaton A such that L(A) = co^ (a || (6£))*^. 


Proposition 3.6 (Lodaya and Weil |LW00| ). Let A be an alphabet. The class of regular 
languages of SP + (A ) is closed under finite union. Furthermore, if B is an alphabet, 
<f> : SP + (A ) —> SP + (B ) a morphism of free sp-algebras, and L a regular language of 
SP + (A), then 4>(L) is a regular language of SP + (B). 

Proof. The closure under finite union is a direct consequence of the generalization of the 
notion of cartesian product of automata to branching automata. The closure under direct 
image by cf is also an easy generalization of the construction for Kleene rational languages. □ 

Proposition 3.7 (Lodaya and Weil |LW01| ). Let p and q be two states of a branching 
automaton A. It is decidable, in polynomial time, if there is a path from p to q in A. 

An automaton is sequentially separated if, for all pairs (p, q) of states, all labels of paths 
from p to q are parallel posets, or all labels of paths from p to q are sequential posets. 

The following proposition will be used later in the paper. 

Proposition 3.8. For every automaton A there is a sequentially separated automaton B 
such that L(A) = L(B). 

Proof. Let A = ( Q , A, E, I, F). The states of B are the elements of Q x B. In B, there is 

• a sequential transition ((p, 6), a, (q, true)) iff ( p,a,q ) € E, 

• a fork transition ((p 0 , b 0 ),{(pi, &i),..., (p n , b n )}) iff {po, {Pi, ■ ■ ■,Pn }) ® E, 
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• a join transition ({(pi,&i),..., {p n ,b n )}, (po, false)) iff ({pi,... ,p n },Po ) e E. 

The initial (resp. final states) of B are those of the form (p,b) with p £ I (resp. i £ F). 
Clearly L(A) = L(B) and B is sequentially separated. Q 


3.3. Rationality and semi-linearity. A subset L of A® is linear if it has the form 

L — Qt || ■ ■ ■ || Rfc || || ■ ■ ■ || 

where the a* and ajj are elements of A and I is a finite set. It is semi-linear if it is a finite 
union of linear sets. 

The class of ||-rational languages of A® is the smallest containing the empty set, e, 
and closed under finite union, parallel product ||, and finite parallel iteration ®. We refer 
to |ES69| for a proof of the following result: 

Theorem 3.9. Let A be an alphabet and L C A®. Then L is || -rational if and only if it is 
semi-linear. Furthermore, the construction from one formalism to the other is effective. 

Proposition 3.10. Let A be an alphabet and L be a rational language of SP + (A). Then 
L C A® if and only if L is || -rational. 


Proof. The inclusion from right to left follows immediately from the definition. Let us turn 
to the inclusion from left to right. There exists an automaton A = ( Q , A, E, I, F ) such that 
L = L(A). For each pair (p,q) of states of A define L P)q as the set of posets labeling paths 
from p to q. As P ■ P' fL L for all P, P' £ SP + (A) we can assume that P ■ P' 0 L Ptq for all 
P, P’ £ SP + {A) and p,q e Q. Then 

Ep,q = U II -kp*,g<r(i) U ( a l 

(p,{pi:--,Pn})eE [olii (p,a,q)eE se q 

({91 , -,<2n},g)€£join 
cr€.Sn 


The set of all such equalities forms a finite system of equations, which can immediately 
be re-written as a context-free grammar G where the usual concatenation is replaced by ||, 
which commutes. By Parikh’s Theorem (see [Parfifij . Theorem 2), each L Pyq is a semi-linear 
set of A® with e 0 L Pi9 , which can be effectively be computed from G. As L = U (ij^ixpLij, 
and the class of semi-linear sets is closed under finite union, then L is also semi-linear, hence 
||-rational by Theorem 3.9 □ 


The definitions of linearity, semi-linearity, rationality and ||-rationality, which are given 
above over free algebras, can naturally be generalized over (non-free) algebras. 


4. Complementation of rational languages 

The first result of this paper is stated by the following Theorem which claims, together with 
Proposition 3.6 that the class of rational languages of N-free posets is closed under boolean 
operations. 

Theorem 4.1. Let A be an alphabet. The class of rational languages ofSP + (A ) is effectively 
closed under complement. 

The proof of Theorem |4.1| relies on the closure under complementation of the class of 


-rational languages of commutative monoids (Theorem 4.2). 
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Theorem 4.2 (Eilenberg and Schiitzenberger, Theorem III of ES69 | ). If X and Y are 
||-rational subsets of a commutative monoid M, then X n Y and Y — X are also || -rational 
subsets of M. 


As emphasized in [Sak03] . if M is finitely generated then Theorem 4.2 is effective. 
Theorem 4.2 was first proved by Ginsburg and Spanier [GS64j in the case of finitely generated 


free commutative monoids. The following proposition is a corollary of Theorem 4.2 


Proposition 4.3 (Eilenberg and Schiitzenberger, Corollary III.2 of [ES69] ). If tp : M' —> M 
is a morphism of commutative monoids, M' is finitely generated and X is a \\-rational subset 
of M, then tp~ l (X ) is a || -rational subset of M'. 


Before going into the details we need to introduce the necessary notions on algebras 
for languages of N-free posets. For the basic notions on algebra we refer to [Alm94] . An 
sp-algebra (S',-, ||) consists of a set S equipped with two operations • and ||, respectively 
called sequential and parallel product , such that ( S , •) is a semigroup (• is associative) and 
(S, ||) is a commutative semigroup. Observe that the notion of an sp-algebra equipped with 
a neutral element 1 (verifying 1 • x = x ■ 1 = x || 1 = x for any element x of the sp-algebra) 
corresponds to bimonoid in (BE96 . For each alphabet A there exists a free sp-algebra 


which is isomorphic to SP(A) (and which is also denoted by SP(A)). For simplicity we 
often denote an sp-algebra (S, ■, ||) by S. A morphism tp : S T between two sp-algebras 
recognizes X C S if X = p~ 1 (R) for some R C T. Sometimes the reference to ip is omitted 
and we say that T recognizes X. The following propositions are easy generalizations of 
well-known results on semigroups (see |Pin84l Prop. 1.8 and 1.9] for the semigroup versions). 


Proposition 4.4. If <p : A —»• S is a map from an alphabet A to an sp-algebra S, there exists 
a unique morphism tp : SP + (A ) —> S such that p(a) = <p(a) for all a G A. Furthermore, Tp 
is surjective iff <p{A) is a generator of S. 

Proposition 4.5. Let A be an alphabet, (p : SP + (A ) —>• S and ip : T —>• S two morphisms of 
sp-algebras, with ip surjective. There exists a morphism p : SP + {A ) —> T such that p = ipp. 
Furthermore, p recognizes any L C SP + (A) recognized by tp. 


A subset X of an sp-algebra S is recognizable if there exists a finite sp-algebra T and 
a morphism tp : S —>• T such that tp recognizes X. A congruence ~ of sp-algebras is an 
equivalence relation compatible with the operations, ie. x ~ y implies that (1) : u-x-v ~ u-y-v 
and (2) : u || x || v ~ u || y || v for all u,v. Actually, as the parallel product commutes in 
sp-algebras, the condition (2) is equivalent to x || u ~ y || u for all u. An equivalence relation 
has finite index if it has a finite number of equivalence classes. It is well-known that the map 
tp ^ : S —>• jSy~ which associates to any element of S its equivalence class in the quotient 
sp-algebra 57~ can be extended in a unique way into a morphism of sp-algebras. Let X 
be a set whose elements are called variables and S be an sp-algebra. A term on S is a full 
binary tree whose leafs are either variables or elements of S, and nodes are a sequential or a 
parallel product. Formally, the set T of terms on S is defined inductively by X C T, each 
element of S is in T and, for all t,t! E T, t ■ t' E T and t || t' G T. Observe that a N-free 
poset labeled by A can be thought of as a term of A (which may not be unique) without 
variables, and reciprocally (note that a term corresponds to a unique N-free poset). A value 
can be associated to any term t whose leaves are all elements of S by the partial function 
e : T —> S inductively defined by e(s) = s for all s E S, eft || t') = t \\ t' and eft ■ t') = t • t'. 
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Let L C S. The syntactic congruence of L on S' is defined by: for all x, y G S, x y if, 
for any term t(x o,..., x n ) on S and any si,..., s n G S, 

e(t(x, si,..., s„)) G L <==> e(t(y , si,..., s n )) G L 

It is well-known that the quotient sp-algebra S/^l recognizes L. Furthermore, the following 
property holds on S/~l- 

Proposition 4.6 (see |Alm94j or [LW00| ). Let S and T be two sp-algebras, L C S and 
ip : S —» T be an onto morphism. Then L = p~ l p(L) if and only if, for any x,y G S, 
p(x) = p(y) implies i/. 

Example 4.7. Let A = {a, b, c } and L = c (a || (bf ))*^ be the language of Example 
Let (S, •, ||) be the sp-algebra defined by S = {a, b, c, s, 0,1}, be = s, a || s = c, 1 is the neutral 
element for both sequential and parallel products, and all other products are mapped to 0. 
Then S recognizes L. Indeed, let p : SP + (A) -» S be the morphism defined by ip(a) = a, 
<p(b) = b and <p(c) = c. Then <p(L) = {c} and L = y? _1 (c). Furthermore S = SP + (A)/~ L . 

Lodaya and Weil have proved the following connection between recognizable and rational 
languages: 

Theorem 4.8 (Lodaya and Weil, Theorem 1 of |LW98aj ). Recognizable languages are 
rational. 


3.1 


However, the following example (from |LW98aj l shows that in general, rational languages 
are not recognizable. 

Example 4.9. Let A = { a , b} and L = (a || b )®. Let ip : SP + (A ) aZU {_L} the morphism 
defined by tp(a) = 1, ip(b) = — 1, xy = T for all x, y G Z U {T}. Then L = <^ _1 (0), and 
thus p recognizes L. Furthermore, SP + {A)/^ L is isomorphic to Z U {T}, thus L is not 
recognizable as a consequence of Proposition |4.6[ Example |3.4| gives an automaton A with 
L(A) = L. 


Let us return to the proof of Theorem 4.1 The first step is the construction of an 


algebra from an automaton. We need to introduce some new definitions, which are applied 
in Example |4.29| at the end of this section. 

Let L C SP + (A) and A = (Q, A, E, /, F) be an automaton such that L(A) = L. 

For every pair (p, q ) of states, define K p _ q to be the set of multi-sets of pair of states as 
follows: 


Kp, q = {{{pi, <? CT (1)), ■ ■ ■ , (Pn, q<T(n))} ■ (P, {Pi, ■ ■ ■ ,Pn}) G ^fork, 


({?i, • • •, q n },q) G £j oin , a G S n and p t ==P q a (i),Pi G SP + (A), for all i G [n]} 

Define also J- p q to be the smallest set of multi-sets of pairs of states as follows. Let 

p 


x’O ._ 

J ~v->q 


{{(p.,)}} if thereexist* p^ q ,P e SP+(A), 
otherwise. 


and 

F: 


' 2+1 _ 

P,Q 


= F va U {M - ( Pj , qj ) U X:M= {( Pl , qi ) k \ ..., 0 Pn , q n ) kn } G F p>q , 


j G [n],kj >0,lG K {r> :r <i : i)} 
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Now let Fp^ q = Ui^Fj ] q . Observe that 0 fL If P = Pi || ■ ■ ■ || P n is a N-free poset 

p 

such that there exists a path 7 : p =7> q which is the parallel composition of n > 0 paths 

A 

p. 

7 i : pi ==> qi, for some Pi,qi E Q, Pi E SP + (A ), i E [n], observe that by construction the 
multi-set {(pi, qi(p n , q n )} belongs to F p , q . Reciprocally, if {(pi, qi(p n , Qn)} G 

p. 

F Pt q, there exist paths pi ==> qi for some Pi G SP + (A ) for all i G [n], that can be composed 
to build a path p Pl =| P,! q. 

Let E, I be sets, X = {Xi : i G 1} and Y be respectively a set of multi-sets of elements 
of E and a multi-set of elements of E. Set 

{Xi : i G I}\Y = {Xi - Y : Y C X h i G /} 

When P G SP + (A), define 


p. 

P{P) = {{(pi, qi), • • ■, (Pn, Qn) ■ Pi ==> Qi for all i G [n]} : 

P = Pi || • • • || P„, Pi G SP + (A) for all i G [n]} 
Thus 7L(P) is the set of all finite multi-sets {(pi, <?i),..., (p n , q n )} over Q 2 such that P = 
Pi || ■ ■ ■ || P n and, for all i G [n], Pi G S'P + (R) and p* ==> q^. 

Let ~_4 be the relation defined on SP + (A) by P ~_4 P' if and only if, for all p,q G Q, 

U u (4.1) 

xen(P) xeTi(P') 

Equivalently, P ~_4 P 7 if and only if, for all multi-sets M over Q 2 , there is some x G 7L(P) 
such that x U M G Pp,q if and only if there is some x 7 G P(P 7 ) such that x 7 U M G P Pi(? . 
Obviously, ~_4 is an equivalence relation. Also, for all p,q E Q, P E SP + (A), we have 

p 

0 G Uxe 7 £(P) Fp,q\ x if an< I only if P=^Q- Immediately, 

p p> 

Lemma 4.10. If P P 7 then p =7 (/ iff p q for all p,q E Q. 

A A 

The following lemma shows in particular that SP + (A )/~_4 is equipped with a structure 
of sp-algebra. 


Lemma 4.11. ~_4 is a congruence of sp-algebra. 

Proof. First we prove that if P ~_4 P 7 then LPR ~_4 LP'R for all L,R E SP(A). Let 
r E U xe'R.(LPR) ^ 7 p,q \ x • If L = R = e the conclusion is trivially reached. Otherwise, LPR is 
a sequential poset of SP + (A). Assume r = {( 77 , si),..., (r*., s^)}. By definition of r, there 
exists a path 7 : p 7 =7 q' such that r U {(p 7 , g 7 )} G F v ^ q . By dehnition of F Pi q there exists a 


path 7 i : ri =7 Si for some Si E SP + (A ) and for all i E [k], and the paths (7$ : r ? ; =>• s*)ie[fc] 

and 7 : p 7 ^T?g 7 can be used to compose a path p p |=l| The path 7 : p 7 ^7? g 7 

A A A 

1 L P R 

can be decomposed into 7 : p 7 =7 t\ ==> t 2 =7- q' for some ti,t 2 G Q. As P ~_4 P 7 by 

AAA 


Lemma 


4.10 


p / p p/ p 

we also have t\ =7 0, thus there exists 7 ' : p 7 =7- 1\ =7- 1 2 =7 g 7 , which can be 
A AAA’ 
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used in parallel with the paths ( 7 * : r\ ==>■ to build a path p R =l ^' k q. Thus, 

r e ^Jxen(LP'R) Fp,q\ x ■ 

We now show that if P ~_4 P’ then P || P" ~_4 P' || P" for all P" E SP(A). The 
case P" = e is a triviality, so we assume that P" 7^ e. Let r = {(ri, si),..., (r*,, s*.)} E 
Ua: 67 e(p||p") X p , q \x. There exist a decomposition P || P" = X\ || ■■■ || X n of P || P ", 

(Pi)9i)»e[n]» Si E SP+{A) for all i E [A;], paths (p t =A%)ie[n] and ( r * s »)ie[fc], that 


is not a max- 


can be composed to form a path p‘ Yl ^ \\.S k ^ ^ ^ 11 

A 

X- 

imal parallel factorization of P || P" then the paths {pi => qi)i£[ n ] can be replaced by 
X' 

(p'i=> q'i)i£[n'] where X[ || ••• || X' n , is a maximal parallel factorization of P || P", such 

x' s- 

that the paths (p i =A Qi)ie{n'] and the paths (r,; =>Sj)j g [ fc ] can be composed to form a path 


A 

*il|-|IW 1 IM Sfc Q ■ 

p => q. bmce a, 

A 1 


X’ , is a maximal parallel factorization of P 


there exists a partition (/, J) of \n'\ such that P =||j 6 j X[ and P" =||jej X'-. As P ~ A P' 


we have 


{(Pj,Qj) : j G «/} U {(ri,Si) : i E [As]} E (P p ,<A\{(Pd q'f) ■ i E /}) D (P M V) 


for some x E 77(P'), so r E (P M \\x)\\{(p'-, g') : J E ./}, ie. r E P p , q \y with y = xU{(p'-, q'j) : 
j E J}. So ~^4 is a congruence of sp-algebra. □ 

Let ip~ A : SP + (A ) -» SP + (A)/~ v 4 the morphism which associates to each poset P E 
SP + (A ) its equivalence class in SP+{A)/~ a . Then ip~ A recognized L, since L(A) = (p *(A) 
where 

X = {(p~ A (P) : 0 E [J Pij\x for some (i, /) E I x P} 
xelZ{P) 

Observe that X may be infinite. 


Example 4.12. Let A be the automaton of Example |3.5| Then SP + (A )/~_4 is isomorphic 
to SP + {A)/^ L ^ 4 ) (see Example 4.7 for S P + (A) / ^ L ^). 

Observe that ~_4 may have an infinite index (take, for example, any automaton of 
language (a || 6 )® - see Example |3.4[ ). 

Lemma 4.13. The number of equivalence classes for ~_4 containing a sequential poset (of 
SP + (A)) is finite. 


Proof. By contradiction, assume that there exists an infinite sequence (Pi)jgN of sequential 
posets such that for all i,j E N, if i 7 ^ j then P,; rf A Pj. To each equivalence class 

[P'i]~A we associate the set Kp f = {( p,q ) E Q 2 : p=j>q}. Clearly there exist a finite 
number of such sets, so there exist i,j, with i 7 ^ j such that Kp i = Kp } - For all p,q E Q, 
S E SP + (A), let Xpq (P) = UxeKfS) -Xp,q\ x - As P: Pj there exist p,q E Q and 

wlog. r = {(ri, si),... , (r n , s n )} E X Ptq (Pi) - X p , q (Pj). As r E X p , q (Pi) there exist paths 

7 i : ri =>■ Si for some Si E SP + (A) and for all i E [n], and 7 : p' ==X q', such that the paths 
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(7i)*e[n] and 7 can be used to compose a path 5 : p ^ =>^ " q. In <5, 7 can be replaced by 

7 ' : p' =4- q' to form a path p J ^ ^ " q. Thus r E X v a (Pj), which is a contradiction. □ 

A A 


Let X C SP + (A). We denote the set of sequential posets of X by 

Seq(X) = {P E X : P E A or 3Pi, P 2 G 5P + (A) such that P = PiP 2 } 

Denote also by L Pi9 the set of non-empty labels of paths from state p to state q in A. We 
are now going to prove that ip~ A (L(A)) is a ||-rational language of (ip^ A (Seq(SP + (A))))®. 
If has a finite index this is a triviality, so assume that it has an infinite number of 
equivalence classes, and recall that ip~ A (Seq(L(A))) is finite by Lemma [4. 13 
We have 


L 


PA ~ 


IJ || Seq(L r)S ) 

XeT p , q ( r - s ) eX 


and 


(Lp^q) 


U II ip^ A (Seq(Lr jS )) 
xeT p , q ( r ’ s )£X 


So, it suffices to show that P p . q is a ||-rational set of ( Q x Q)® in order to prove that 
ip^ A (L Pt q) is a 11-rational set of elements of p~ A (Seq(SP + (A))), and thus so is ip~ A (L(A)) = 

U (iJ)£lxF<P~ A (Li,f). 

Lemma 4.14. J~p. q is a ||-rafionaZ set of (Q x Q)®. 


Proof. First observe that 0 fL J~p,q- Build an automaton B whose alphabet is Q x Q as follows. 
Take two copies Q\ and Q 2 of the states of A. For each fork transition (r, { 77 ,..., r n }) of A, 
add the same fork transition in Q\. For each join transition ({si,..., s n }, s) of A, add the 
same join transition in Q 2 . For each pair of states (r, s) such that there is a non-empty path 
from r to s in A, add a sequential transition from r in Q\ to s in Q 2 , labeled by (r, s). The 
initial state is p in Q\ and the final state is q in Q 2 . The language of B is P p>q . There is no 
path in B with a sequential transition or a join transition followed by a sequential transition 
or by a fork transition. As a consequence of Theorem 3.3 and Proposition 3.10, L{B) is 
||-rational. □ 


Observe that the construction given in the proof of Lemma 4.14 is effective as a 


consequence of Proposition 3.7 


Define the equivalence relation ^ over the elements of SP + (A) by P P’ 


2 . „ P _ f „\ ^ /^2 . „ L ^ td p/ .g, 


p =7> q} , or equivalently P 
e U*erc(P) = {(PAl) e Q 2 : 0 G UxgW(P') Note that 


iff {(p,q) E Q 2 : p=>q} = {( p,q ) E Q 

{( p,q ) e Q 2 


^ has finite index, since Q 2 is finite. When D E V(Q 2 ), denote by ( ' A> = {P E 


SP+(A):p^qi& (p,q) E D}. 


Lemma 4.15. Let <p : SP + (A) -A S be a morphism of sp-algebras. The following conditions 
are equivalent: 

(1) P ~ 0 P+(A) P’ for all P, P’ E p-\s), s E S; 

(2) ip recognizes L Ptq for all ( p,q ) E Q 2 . 

Proof. If (1) is true then ip recognizes L pA for all (p,q) E Q 2 since 

l p ,= u Ewir’D 

D£V(Q 2 ) 

(p,q)£D 
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Conversely assume (2) and, by contradiction, that (1) is false, ie. there exist s G S, 
P,P' G (/? _ 1 (s) such that P G A^ P -' 4 \ P 1 G ^ with D 7 ^ ID'. Then 99 can not 

recognize L Ptq with (p, q) £ D and (p. q ) 0 D’ , or the converse. □ 

Let </? : PP + (A) -> S be a morphism of sp-algebras such that P ~ 0 P ^ P’ for all 
P,P' G y? _ 1 (s), s & S. We define the equivalence relation over the elements of S by 

s ~0 s' iff there exist P G </?(s), P' G <£>(s') such that P ~g P ^‘ 4) P'. We have P ~g P ^ P' 
iff ¥>(P) rs_/ 0 p(P'). If ip is surjective then ~jj has finite index, and each equivalence class of 
~0 can be denoted /S. s D = {<p(P) G S : P G A^ P for some P G P(Q 2 ). Furthermore, 
<^(A PP+(j4) ) = Afj and ^ _ 1 (Af,) = A PP+(j4) . 

Lemma 4.16. Let 99 : SP + (A) -A S be a surjective morphism of sp-algebras such that 

• ip recognizes L pi9 for all ( p,q ) G Q 2 . 

• ip(L Ptq ) is a ||-ratzona/ of S for all ( p,q ) G Q 2 . 

Each equivalence class A|, of is a || -rational set of S. 


- 1 (A| t ) = A sp+</1 > - 


D — A (p.q)£D-hp,q ^(p,g)^D ^p,q'i as T{-^p,q) is ||-rational 
for all (p, q) G Q 2 


and as by Theorem 4.2 the class of ||-rational 


Proof. We have </? 
and </9 recognizes L Pi? 

sets is closed under finite boolean operations, A^ = T>{T\(p,q)eDLp, q — U(p,<?)£D-^p,g) = 
n (p,q)&D<p(Lp,g) - u (p,q)^ D p(Lp, q ) is ||-rational. □ 


It is clear that the morphism of sp-algebras p^ A : SP + (A) — > SP + (A )/~_4 verifies the 
conditions of Lemma 14.161 

When S is an sp-algebra, define also the equivalence relation ~f, q on the elements of S 
by S rsJ £ q s' iff xs = xs' and sx = s'x for all x G S. The relation between ~g P b 4 )/^ and 
~Sq ^ a ^~ a is given by the following lemma: 


Lemma 4.17. If s 


SP+(A)/~ a , 

0 S 


then 


s ~ 


SP+(A)/~ a , 


seq 


Proof. Let P G </?^(s) and P' G <p^f A {s'). We have P ~0 P ^ P', then p ==> q iff P==^ Q f° r 

all p,q G Q, thus p => q iff p => q, for all T G SP + (A), p,q G Q. It follows that 1Z(TP) = 

H(TP'), and thus TP ~ A TP' , ie. tp^ A (TP) = tp^ A (T)ip^ A (P) = <p„ A (T)tp„ A (P') = 
ip^ A (TP') for all T G SP + (A). As <p~ A : 5P + (A) -A 5P + (A)/~^ is surjective, it follows 
that x<p~ A (P) = Xip^ A {P') (and, using symmetrical arguments, ip^ A (P)x = p^ A {P')x) for 

□ 

is a II -rational 


all x G 5, thus <p~ A (P) ~feq ^ <p~ A (P'). 

Lemma 4.18. ~f e P ( a ^~ a has finite index, and each of its equivalence classes i 


as a 


set of SP + (A)/~ A . 

Proof. As ~^ P ^ has finite index, then so is ~g P bL)/~.A, anc [ ^hus ^f eq 

Each equivalence class of ~f> P <yA ^~ A is a finite union of 

union, 
rational. M 


4.17 


consequence of Lemma 

equivalence classes of ~^ P+ ^^~' 4 . As the class of ||-rational sets is closed under hnite 
it follows from Lemma 


4.16 


that each equivalence class of ~f> P b 4 )/~A is 







14 


N. BEDON 


We have SP + (A) — L = IJ DeV (Q 2 ) ip 

DnixF=<6 


-1 

A 


IAT (av ' 


Since the class of rational sets of 


SP + (A ) is closed under finite union, it suffices to show that <pZ A (A'^ is a rational 

set of SP + (A ) for each D E V(Q 2 ) in order to show that SP + (A) — L is a rational set of 
SP + (A). This will be achieved by Lemma 4.28 below. 

We need to introduce the notion of ||-quotient of a language. Let L,L' C SP(A). The 
||-quotient of L' by L is 

L'\L = {P E SP(A) : 3P 1 E L such that P || P' E L'} 


Lemma 4.19. Let A be an alphabet, and X, Y be two ||-rational languages of A®. Then 
Y\X is || -rational. 


Proof. By Theorem 3.9, X and Y are semi-linear. By Theorem |5.2[ X and Y are also 
Presburger sets of some formulae ipx(x\ ,..., x n ) and ipy(yi, ■ ■ ■, y n )- Up to a change in 
variables names we can assume that the free variables of both formulae are disjoint. Then 
Y\X is the Presburger set of 

<Py\x(zi, ■■■,z n )= ...,x n ,yi,...,y n ip x (xi, ■ ■ ■, x n ) A tp Y (yi ,.. • ,Vn) A ie[n ] Zi + Xi = y { 

Using Theorems 5.2 and 3.9 again, Y\X is ||-rational. D 


Lemma 4.20. Let ip : SP + (A) —>• S be a morphism of sp-algebras. If S is finite, then 
<^ _1 (s) is a regular set of SP + (A), for all s E 5. 


Proof. We build an automaton B s such that L{B S ) = 99 -1 (s). The construction is a 
generalization of the well-know construction from finite semigroups to automata for finite 
words. Consider all the elements of S as the states of B s , with one new state 1. The unique 
initial state is 1, the unique final state is s. Furthermore, in the following, 1 is considered 
as a neutral element for both sequential and parallel product regarding the definition of 
transitions The sequential transitions are as usual: for each state t and letter a, add a 
sequential transition from s to tip(a). Let us deal now with the parallel product. For each 
f E 5, add two new states t\ and p, a new letter t in the alphabet of B s , and a sequential 
transition tf). We name the states of the form t,\ and ta special states , and the new 
letters of the form t special letters. For each f E 5, add a fork transition (t, {1,1, t±}). For 
each t,u,v E S, add a join transition ({v 2 ,t, u},v ■ (t || u)). Let B be the set of special letters. 
Consider the projection p : SP + (A U B) —> SP + (A ) which removes special letters from 
posets. We have p(L(B s )) = We now have to show that p(L(B s )) is rational (note 

that in general, projection does not preserve rationality). This is achieved by replacing, in the 
system of equations in the McNaughton-Yamada-like construction of a rational expression 
from an automaton (see [LW98al Section 4.2]), the equations of the form £, Sl ,.s 2 = s, whose 
solution is L P)Q , by fs 1 ,s 2 = {®} : those replacements do not affect the form of the system, 
whose solution remains rational. □ 


Lemma 


4.20 proves that (pz\(s) is regular, for all s E 5P + (A)/~^, when 5P + (A)/~^ 
is finite. We are now going to prove that ipz} A {X^ j s regular for every equivalence 

class A^ P Q f A)/~a , even w j ien SP + (A)/^_ a is not finite. The idea is to build 

an automaton B sp+(a)/~ a as in the proof of Lemma 4.20 


by showing that a finite subset of 


SP + ( A )/~_4 can be used for the states of B sp+(a)/~ a ■ Actually we prove this by translating 
the problem into an sp-algebra N fc * with more properties than SP + (A)/~_ 4 . Very informally 










LOGIC AND BRANCHING AUTOMATA 


15 


SP + (A) ^4 SP + (A)/~ a 


N fc * 


Figure 4: The morphisms between the sp-algebras. Full arrows represent morphisms of 
sp-algebras, and dashed arrows morphisms of commutative semigroups. 


speaking, denote by {g\,... ,gk} the set of e quival ence classes of ~ A containing a sequential 
poset of SP + (A) (which is finite by Lemma 4.13). For every P E SP + (A) whose maximal 
parallel factorization is P = Pi || • • • || P n , the morphism g : SP + {A) —y N k * enables the 
count of #4, i E [n], such that Pi E g-j , for every j E [k]. Also, every language recognized by 
SP + (A)/~ a is recognized by N fc *. 


Let G = {gi,...,gk} = W~ A {P) : 
Lemma 4.13) generator of (S'P + (A)/~_ 4 , 


P E Seq(SP + (A))}. Then G is a (finite, by 
||). We may suppose, by Proposition 3.8, that A 


is sequentially separated. Thus that the elements of G are indecomposable with respect 
to the parallel product, that is to say, each gi E G can not be written gi = s || s' with 


s,s’ eSP+ (A)/~ A . 

Denote by (N fc *,+) the commutative semigroup whose elements are /c-tuples of non¬ 
negative integers, without (0,..., 0). It is generated by the £;-tuples with all components set 
to 0, except one which is set to 1. For short we denote by 1* the element of the generator of N k * 
with the component set to 1. The parallel product + of (N fc *, +) is the sum componentwise. 
We define a morphism of commutative semigroups ip : (N fc *,+) -A (SP + (A)/~ a , ||) by 
ip(V) = gi for all i E [k]. Note that ip is surjective, and that ip _1 (c/i) = {1*} for all i E [k]. As 
a consequence ip~ 1 (ss' ) is a singleton for all ss' E SP + (A)/^ a . Now we equip (N fc *, +) with a 
sequential product, by setting, for all ni, 712 E N fc *, nin 2 = ip^ 1 (ip(ni)ip(n 2 )) ■ This sequential 
product equips N k * with a structure of semigroup since (n \ 112 ) ns = ip^ 1 {ip{n\ri 2 )s^) = 
ip~ l {ip{ip _1 (sis 2 ))s 3 ) = V' 1 ( s i s 2 'S 3 ) = 1 ^ 1 ( 712 ^ 3 ); considering ip(rii) = Sj for all i E [3]. 
Thus, N fc * equipped with its parallel and sequential products is an sp-algebra. Observe that 
ip{n\ri 2 ) = ip(n\)ip{n 2 )- Now, we define a morphism of sp-algebras g : SP + (A ) —> N fc * by 
g(a) = ip~ 1 ig r ^ A (a) for all a E A. The diagram of Figure [ 4 ] sums up the situation. 


Lemma 4.21. For all s E SP + (A)/~ a , = g l ip 1 (s). 


Proof. We show by induction on P E SP + (A) that if P belongs to one side of the equality 
then it also belongs to the other. Let P = a E A. If P E <A3^(s) (thus s E G), by definition 
g(P) = V ;- 1 (s). If g(P) E ip^ 1 (s) then s = ip^ A (P). Assume now that P has the form 
P = Pi || P 2 for some Pi,P 2 £ SP + (A). If P E y>“^(s) then ip^ A (P) = s = g^ A (P\) || 
ip~ A (P 2 )- Denote by s* = (p^ A (Pi ), i E [2]. By induction hypothesis P,; E g~ 1 ip 1 (si). Thus 
Pi || P 2 E (^~V _ 1 (si)) || (g~ 1 ip~ 1 (s 2 )) Q g-'ip- 1 ^). On the other side, if P E g~ 1 ip~ 1 (s), 
it follows from the induction hypothesis that ipg(Pi) = Si, i E [2]. As ipg(P) = ipg(P\ || 
P 2 ) = ipg(Pi) || ipg(P 2 ) = si || s 2 , we have P E g~ l ip~ l (si || s 2 ), thus si || s 2 = s and thus 
P E <gz}A s )- Finally assume that P has the form P = PiP 2 for some Pi, P 2 E SP + (A). If 
P E <pz\(s) we proceed as in the case P = P\ || P 2 , observing that ip~ 1 {si)ip~ 1 (s 2 ) C ^ _ 1 (s) 
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if S 1 S 2 = s: if rii E ip 1 (sj), i E [2], then by definition n\n 2 = 1 (siS 2 ) = ip 1 ('S). On the 

other side, if P E p^ 1 ip~ 1 (s), it follows from the induction hypothesis that ipp(Pi) = Si, 
i E [2]. We have tpp(P) = 7pp(PiP 2 ) = tp(p(P 1 )p(P 2 )) = iplp(Pi))ip(p(P 2 )) = si« 2 , so 
sis 2 = s. As ip~ A (P) = p^ A (Pi)(p^ A (P 2 ) = sis 2 = s. □ 


Lemma 4.22. p is a surjective morphism of sp-algebras. 

Proof. Let n E N fc *. If n = 1* for some i E [A;] then n = xp~ 1 (gi). As is surjective 
there exists P E 92 “^ (gi), thus P E p~ l (n). Otherwise, n = l * 1 + • • • + l v for some r > 1 
and ii,...i r E [&]. As for all j E [r], 1^ = ip~ 1 (9i j ), there is some Pj E <pZ\(9ij), and 

p(P\ || ■ ■ • || Pr ) = p(Pl) + ' • ‘ + p(P r ) = l* 1 + ‘ ‘ ‘ + l* r = 71. EH 


Lemma 4.23. p verifies the conditions of Lemma f. 16. 

That p recognizes L Piq for all (p. q ) E Q 2 is 


Proof. First, p is surjective by Lemma 
a consequence of Lemma 


4.21 


4.22 

Because p(L p%q ) = U.YS.K, 


_ . Pi , ,1 ( r , s )ex 9{Seq(L r , s )), and as 

p(Seq(SP + (A))) is finite, then p(L Ptq ) is a ||-rational set of N fc *. □ 


It follows from Lemmas 


4.21 


and 


4.15 


that the equivalence relation ' k * can be dehned 

4.22 As a 


over the elements of N fc *. Further mo re, re call that p is surjective by Lemma 
consequence of Lemmas 4.21, 4.23 and 4.16 each of the equivalence class Avl * of 


N fe * • 

is a 


||-rational of N fc *, and thus, by Theorem 3.9 has the form = U i & j D (ao,i + B® f) for some 
finite set Id, a,D,i £ N fc *, Bd,i some finite part of N fc *. For all i E Id set A^* = ao,i + Bfj i . 
We may assume that all the A^ are pairwise disjoint [ES691 Theorem IV], Note that the 
decomposition of Ainto A^f* = U 7 ; e / ;J A r ^” is not unique. This decomposition may 
influence the constructions below, but not the main result (Lemma 4.28). 

The following lemma links the equivalence classes of P A)/~a anc [ _ 


Lemma 4.24. For all ni,n 2 E N k *, n\ ~g fe * 712 iff tp(ni) P ip(n 2 ). 


Proof. Assume first there exist Pi E p 1 (^i) ) P 2 & p 1 ( ti 2 ) with P\ ^ P 2 . As a con¬ 
sequence of Lemma 4.21 we have <p^ A (Pi) = ipp(Pi) for all i E [2], thus ipp(P\) ( a ' > ^~ a 
ipp (TE)■ Assume now, for the inclusion from right to left, that there exist Pi E i pZ 1 A (' l P( n i)) f° r 

all i E [2], with Pi ^ P 2 , but P{ A$ P ^ P 2 for all P[ E /x _1 (m), P 2 E ^ _1 (n 2 ). As a 
consequence of Lemma 4.21 we have, for all i E [2], ipp(P-) = ip{rii), thus P L ,P[ E 

P 2 ~ 0 P+(A) P -2 we have P[ P+(A) P 2 , 

□ 


and thus Pi ^ P\. As P( 
which is a contradiction. 


SP+(A) 


Pi 


SP+(A) 

A 


Lemma 4.25. For all 1.1’ E N fc *, if t ~ 0 k * t! then t tl. 

Proof. We first show ip(tn) = ip(t'n) and ip(nt) = ip(nt'). Indeed, let P,P',N E SP + (A) 
such that p(P) = t, p(P') = tJ and p(N) = n. If t t' then for all (p, q) E Q 2 , P is the 
label of a path from p to q in A iff so is P'. As a consequence, for all (p, q) E Q 2 , PN is 
the label of a path from p to q in A iff so is P'N. Thus 1Z(PN) = IZ(P'N). This implies 
ip^ A (PN) = ^JP'N), thus ip(tn) = ip(t'n). Now, by definition tn = ip~ l {ip{t)ip(n)), thus 
ip(tn) = 7 p(t)ip(n) = gi = ip(t'n) for some gi E G, thus tn = tin = 1*. We show that nt = ntl 
using symmetrical arguments. □ 
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Lemma 4.26. For all n E N , p 1 (n) is a regular set of SP + (A). 


The construction given in the proof of Lemma [4. 26| below is illustrated by Example 4.29 
located at the end of this section. 

Proof. The lemma is achieved by constructing an automaton B n from N fc *. Take m an 
element of N fc * which is greater than n and all od,« + &?;, for all b t E Bo,i, i E Id, D E V{Q 2 )- 
The finite set S of states of B n consists in 

• Si = {x E N fc : x < m}; 

• for each = ao,i + P§j, a new state A £>,i (set S 2 = {A d,% '■ i E Id,D E P(Q 2 )}); 

• for each element x E S± U S 2 , two additional special states xj and X 2 (set S 3 = {xi : i E 
[2], x E S'] U S2}) and a new letter x. 

For uniformity with the construction given in the proof of Lemma 4.20[ we denote by 
1 = (0,..., 0). We have S = S1US2AS3. For all s, s' E Si, A^, E S 2 , s" , s'" E (SiUS^) —{1}, 
define o : (Si U S2) 2 -A Si U S2 such that 1 is a neutral element for o, A o s'" = aD,i 0 s'", 
s" o A r>,i = s" o ajjj, s o s' = s ■ s'. Note that x o y E S2 iff one of x, y belongs to S2 and 
the other is 1. It is just verification to check that o is associative, as a consequence of 
the associativity of ■. The sequential and fork transitions whose source belongs to Si U S 2 
are defined as in the construction in the proof of Lemma 4.20, by replacing the sequential 
product of SP + (A)/~yi by o. For all s, s' € S 1 U S 2 , add a sequential transition (si, s, S 2 ). 
Define also 

<5(s + s') if s, s' € S 1, 

A D ,i if s = A Dii and s' E B D<i , 

Ac,i if s E B D)i and s' = A Dji , 

x undefined otherwise 
Si U S 2 is given by, for all n E N fc , 


s = < 


where <5 : N fc 


S(n) = 


n 


if n < m, 


A D t i if n E Ap i and not n < m. 


The join transitions are defined as follows. For each s,t E (Si U S 2 ) — {1}, U2 E S 3 , add a 
join transition ({u2, s,i},ao(s® t)) if u o (s © t) is defined. The unique initial state of B n is 
1, and its unique final state is n. From now we slightly change our notation for simplicity: 

we denote by v=^x the existence of a poset P' E SP + (AU B) such that p(P') = P (p 

E>n 

is defined as in the proof of Lemma |4.20 ) and of path in B n from v to x labeled by P'. 
We claim that, for all P E SP + (A), v,x E Si U S 2 , iff x = v o 5 p(P). First, the 

implication from left to right. We proceed by induction on P. If P = a E A then necessarily 
p(P) < m. By construction there is a sequential transition labeled by a from v to x iff 

x = vop(a) = vo 6 p,(a). Assume now P = P1P2 for some Pi, P2 E SP + (A). A path 7 : v ==> x 

E>n 

can be decomposed into 7 : v ==> y x, and by induction hypothesis we have y = vodp(Pi), 

thus x = (v o 5 p(Pi)) o <5/i(P 2 ) = wo (8p(Pi) o 5 y(p 2 )) with the help of the associativity of o. 
As a consequence of Lemma 4.25 and by definition of 5 , we have ( 5 (n)) o x = n ■ x for all 
n, x E N fc *, thus 5p(Pi) o 5 p(P 2 ) = //(Pi) • //(Pa) = //(P 1 P 2 ) = <5/x(PiP2) since //(PiP 2 ) < m 
because //(P1P2) has the form V for some i E [k]. Finally assume P = P\ || P2 for some 
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Pi, P2 E SP + (A). If there is a path 7 : v = 7 - x, then by construction and with the help of the 

Pn 

induction hypothesis it has the form 7 = (r,{l, 1 , ^1}) (71 || 72 || 73)({d//(P(), 5 ti(P 2 ), ^2}, x) 
where 7 i : 1 =A Sfi(P-) for all i E [ 2 ], and for some P[, P 2 E ,SP + (A) such that P = P[ || 

Pn 

P 2 , and 73 is the path consisting of the sequential transition (y 1 , 0 , 02 ). By definition 
of the join transitions we have x = v o (5fi(P[) © 5p{P 2 )). Thus d//(P{) © dfi(P 2 ) is 
defined and we have three cases. In the first case 5//(P{), 5fi(P 2 ) E 5j and we have 
<5//(P{) © 5p(P 2 ) = S(iJ,(P{) + //(P 2 )) = 5/t( P ). Up to a symmetry, the second and third 
cases are similar, so assume wlog. we are in the second case: <5//(P[) © 5p(P 2 ) = A d,% 
with B//(P() = A d,i and 5fi(P 2 ) E Bd,i- Necessarily //(P{) = an,i + Bj,i + • • • + for 
some r E N, b t j E Bo., for all j E [r], and p(P 2 ) = 6 i,r+i for some bi^ r+ \ E Bd,z- Thus 
//(P) = aD,i + &i, 1 + • • • + bi r + and 5//(P) = Ad,i- Let us turn now to the implication 
from right to left. The cases P = a and P = P 1 P 2 for some P\ , P 2 E SP + (A) are as 
above, so assume P = P\ || P 2 . Up to a parallel refactorization of P we may assume, if 
<5//(P) = Ao t i for some Ao^, that //(Pi) = ao,/ + + • • • + 6 j, r for some r E N, b t j E Bd,% 

for all j E [r], and //(Pa) = &i, r +i for some 6 j jr+ i E Bd,z- So assume first 5//(P) = A^: 
either //(Pi),//(P 2 ) E Si, or <5//(Pi) = Aq^ and //(P 2 ) E Si. In the first case, for all 

p. 

i E [ 2 ], f-i(Pi) = S/i(Pi ), and by induction hypothesis there is a path 7, : 1 1 o d//(Pj) = 

Pn 

8 fj,(Pi) = //(Pi). By construction there is a fork transition / = (o, { 1 , 1 , Ui}), a sequential 
transition t = (01,0,02) and a join transition j = ({//(Pi), //(P?), U2}, o o (//(Pi) © //(P?)) 
with o o (//(Pi) © //(P2)) = vo 5 (n(Pi) + //(P2)) = o o < 5 //(P), thus /(71 || 72 || t)j forms a 

p 

path 7 : 0=^-0 o 8 fi(P). In the second case, by induction hypothesis there exist a path 

Pn 

P P 

71 : 1 1 o 8 /jl(Pi) = and a path 72 : 1 ==> 1 o < 5 //(P 2 ) = //(P2). By construction 

there is a fork transition / = (o, {1,1, hi}), a sequential transition t = (hi, o, 02) and a 
join transition j = ({A d,u //(P2), 02}, o o (A^j © //(P2))) that can be used to form a path 

p 

7 : o=>o o 5 n(P) because A o,i © //(P2) = A^ = //(P). Finally the case 5 //(P) E Sj is 

Pn 

identical to the case 5 //(P) = Ao^ with //(Pi),//(P2) E Si. □ 

Lemma 4.27. For all D E V(Q 2 ), i E In, // _1 (A^ fc *) is a regular set of SP + (A). 

Proof. The construction is almost the same as in the proof of Lemma | 4 . 26 | We only change 
m to be greater than all ao,i + bi, for all bi E Pd,*, i E Id, D E V(Q 2 ), without considering 
to, and the final states are A07 and all states belonging to a0,1 + PfJ $. □ 


Lemma 4.28. For all D E V(Q 2 ), </?^(A^ P a re g U i ar se / 0 f SP + (A). Similarly, 

for each equivalence class c of ~seq </>“{fc) is a regular set of SP + (A). 


Proof. By Lemmas 


4.23 


and 


4.15 


ip 1 


( A SP +( a )/~ a i _ 


) = A^f ^ = // 1 (A^*). Because 


// X (A^ *) = Ui e / D // 1 (A^,J), with Id finite, and regular sets are closed under finite union, 
it follows from Lemma 


Lemma 


4.17 


4.27 


that 


-1 

A 


(A s d p+{a)/ ~ a 


) is a regular set of SP + (^4). As by 


an equivalence class c of ~f e q is a finite union of equivalence classes of 

□ 


_SP (A) 7 ^ 4 ” ^j^(c) is also a regular set of S'P + (A). 
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We now give an example illustrating the construction given in the proof of Lemma |4.26 

Example 4.29. We consider the rational language L = ((aa) || a)®a of SP + (A ) with 
A = {a}, and the automaton A pictured in Figure [5] which verifies L(A) = L. We have 



Figure 5: An automaton A with L(A) = ((aa) || a)®a. 





u {{( 1 , 7) n , ( 2 , 5) m , (3, 6 ) m }} if (p, q ) = ( 1 , 7); 

n+m>0 

{(Oh*?)}} if (p,q) G {( 2 , 4 ), ( 4 , 5 ), ( 2 , 5 ), 

(3, 6 ), (7, 8 ), (1,8)}; 

0 otherwise. 


As stated in Lemma 4.14, is a ||-rational set of (Q x Q)®, with Q the set of states of 

A, for all (p, q) e Q x Q. For example, Jqq 7 ) = ((1, 7) + ( 2 , 5) || (3, 6 ))®. We now compute 
SP+(A)/~ a . First observe that (k > 0 in the equalities below): 

U [J (((ovyrMwni 

xe77((aa)ll fc ) 


n,mGN 

m>k 


U A,,7)\\*= u {{a, 7 )", (2. sr. ( 3 , 6 )”-*}} 

x£.'R,(a\\ k ) 


n,m£ N 
m>k 


U Ai. 7 )V= U {{( 1 . 7 )”, ( 2 . 5 )"*, ( 3 ,6)™}} 

irG77((a||(aa))ll fe ) 


n,mSN 


with 

lZ(a^ k ) ={x\, ..., Xk} with Xi G {(2,4), (3,6), (4, 5), (7, 8)} for all i G [k] 
K«aa)») ={{(2,5)*}} 

K((a || (aa))<‘) ={{(1,7)*}} 

As 0 € UieK(( n ||(na))ll fr ) AulV't (take n — m — 0) then 1 7 for all k> 0 , On the 

other side, as 0 0 Utre7^(all te ) ( res P- ® £ Uxe^((oo)ll fc ) -^(1,7) V)> the 11 for a11 k > 0 , not 

1 =4> 7 for all k > 0 (resp. not 1 7). Uxe77((aa)ll fe ) ^{l,7)\ x 7 ^ U;Ee7e((a.a.) II fe ') - ?: '(l,7)\\ a:; 

and Uxe7?.(all fc ) V / U.- ce 7 e( 0 .l|fc') ^(1,7)'V for a11 k, k! > 0 with k / A; 7 , then SP + (A)/~ a 
has not hnite index. Actually SP + (A)/~ a is composed of the following equivalence classes 
(recall that <p~ A : SP + (A) —> SP + (A)/^ a is the morphism which associates to each poset 
P G SP + (A) its equivalence class in SP+(A)/~ a ): 

• for all k > 0, one class denoted by a^ k , such that ip^ A (a^ k ) = a^ k ; 

• for all k > 0, one class denoted by (aa)^ k , such that < p^ A ((aa)W k ) = (aa)H fc ; 
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• one class denoted by (aa) || a, such that <p^, A (((aa) || a)H fc ) = (aa) || a for all k > 0 ; 

• one class denoted by ((aa) || a)a, such that <p^ A ((((aa) || a)" k )a) = ((aa) || a)a for all 
k > 0; 

• one class denoted by 0, such that ip^ A (P) = 0 for all P £ SP + (A) which are not mentioned 
above. 


The sp-algebra SP + (A)/~a is equipped with the parallel product || verifying 

a \\( k ~ k ') if /j > y 

(aa)H fc ' = < 


aa)II 
(aa) || a 


if k! > k 
otherwise 


a)\ k || a^ k ' = alK^+^O; 

(aa)H fc || (aa)H fc/ = (aa)l^ fc+fe ^; 


• ((aa) |j a) || a k = a fc ; 

• ((aa) jj a) |j ( aa) k = ( aa) k . 


with (aa) |j a idempotent, and the sequential product • verifying 


• a ■ a = aa; 


• ((aa) || a) • a = ((aa) || a)a. 


such that 0 is a zero for both products, and all products undefined above are mapped to 0 . 
It recognizes L since L = cp~^(((aa) || a)a). We have ip^ A (Seq(S P + (A))) = {a,aa, ((aa) || 
a)a, 0}, whose cardinality is 4. Consider N 4 *, and define the morphism of commutative 
semigroups ^ : (N 4 *,+) —> (SP + (A)/^_a, ||) by 

• ^((1, 0 , 0 , 0 )) = a; • V’((0, 0 , 1 , 0 )) = ((aa) || a)a; 

• ^(( 0 , 1 , 0 , 0 )) = aa; • ^(( 0 , 0 , 0 , 1 )) = 0 . 

Equip N 4 * with its sequential product. We have n(l, 0,0,0) = (0,0,1,0) for all n £ 
{(k,k, 0 , 0 ) : k > 0 }, ( 1 , 0 , 0 , 0 )( 1 , 0 , 0 , 0 ) = ( 0 , 1 , 0 , 0 ) and all other sequential products 
are mapped to (0,0, 0,1). Define also the morphism of sp-algebras fx : SP + (A ) —> N 4 * 
by n{a) = ip- 1 ip(a) for all a £ A. Note that L = // - 1 ((0,0,1,0)) and SP + (A ) — L = 
// - 1 (N 4 * — {(0,0,1,0)}) with N 4 * — {(0,0,1,0)} a ||-rational language of N 4 *, since 

N 4 * - {(0,0,1,0)} =(1, 0, 0, 0) || B® + (0,1, 0, 0) || B® + (0,0,1, 0) || B ® + (0, 0, 0,1) || B® 

where B = {(1,0, 0, 0), (0,1,0, 0), (0,0,1,0), (0,0,0,1)}. 

Set 


. D 1 = {(2,4), (4, 5), (3, 6 ), (7, 8 )}; . D 3 = {(1, 8 )}; 

. D 2 = {(2,5)}; . £) 4 = {(1,7)}. 


We have 


. Ag* = {(1,0,0,0)}; 
. Ag* = {(0,1,0,0)}; 
. Ag* = {(0,0,1,0)}; 


• Ag = {(A;, A:, 0,0) : k > 0 }; 

• Ag fe * = 0 for all D £ V + (Q 2 ) - 
{Di, D 2 , T> 3 , D 4 }; 

A#* _ -RT4* /1 I 


Now, from any element s £ N 4 *, say for example s = (1, 2,1, 0), we construct on automa¬ 
ton A s such that L(A S ) = ^ - 1 (s), following the construction of the proof of Lemma 4.26 
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Note that /r _ 1 ((l,2,1,0)) = {a || (aa) || (aa) || ((a || (aa))H fc a) : k > 0}. The first step of the 
construction consists in writing all the sets *, D 6 V(Q 2 ), as a union of disjoint linear 
sets of N 4 *. This is trivial when D is one of Di, Z) 2 , D 3 , or D 6 V + (Q 2 ) — {D\, D 2 , D 3 , D 4 }. 
For A^*, we have A^* = (1,1,0,0) || (1,1,0,0)®. This could also easily be done for A^*, 
but it can be avoided. Indeed, assume that Aj^ is partitioned into finitely many linear 
sets: A^* = Ujg/gA^* with a finite set. Assume also that A s has one state Ag ,; for 
each * 6 / 0 . Take one of those states A@_j. Following the construction of A s , it can be easily 
checked that if a path uses one of the states Ag % or ( 0 , 0 , 0 , 1 ) then it continues either in 
Ag j or in (0, 0, 0, 1 ): the final state s of A s is unreachable. 

Let us then return to the construction of A s - Choosing m as small as possible using the 
remark above, we have m = ( 2 , 2 , 1 , 0 ), then Si = {(xi, X 2 , x 3 , 0 ) 6 N 4 : xi,x 2 < 2 ,x 3 < 1 }, 
and we can reduce S 2 to S 2 = {Ad 4 }. The initial and final states are respectively 1 and s. 
The (useful) sequential transitions are 


• ( 1 , a, ( 1 , 0 , 0 , 0 )), • ((fe, k, 0 , 0 ), a, ( 0 , 0 , 1 , 0 )) for all 0 < k < 2 , 

• ((1,0,0,0), a, (0,1,0,0)), • (A D 4 ,a, (0,0,1,0)). 


The fork transitions are from all state t to {1,1, £ 1 }. Finally, the join transitions are 

• ({T 2 , (xi,X 2 , x 3 , 0), (x^, x' 2 , x' 3 , 0)}, {x\ + x[,X 2 + x' 2 ,x 3 + x' 3 , 0)) when x\ + x \, x 2 + x 2 < 2 
and x 3 + x 3 < 1, 

• ({I 2 ,(xi,X 2 , 0 , 0 ),(x / 1 ,x , 2 , 0 , 0)}, A^ 4 ) when x\ + x[ = x 2 + x 2 > 2, 

• ({T 2 ,A D 4 ,( 1 , 1 , 0 , 0 )},A D4 ). 

Figure [i] represents a successful path in A s labeled by a || (aa) || (aa) || ((a || (aa))^ 3 a). 



a 


1000 
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a 
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Figure 6 : A path labeled by a || (aa) || (aa) || ((a || (aa))H 3 a) in A s . In order to lighten the 
picture, states of the form (x\, x 2 , x 3 , X 4 ) are denoted xiX 2 X 3 x<i. Also, the special 
states and transitions using them have been removed or simplified. 


5. P-MSO 

In this section we define a logical formalism called P-MSO, which is a mix between Pres- 
burger jPre30| and monadic second-order logic, and that has exactly the same expressivity 
as branching automata. As all the constructions involved in the proof are effective, then the 
P-MSO theory of the class of finite N-free posets is decidable. 
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Let us recall useful elements of monadic second-order logic, and settle some notation. 
For more details about MSO logic we refer e.g. to Thomas’ survey paper jEF991 ITho97j . 
The monadic second-order (MSO) logic is classical in set theory, and was first set up by 
Biichi-Elgot-Trakhtenbrot for words [ BiicGO ;. |Elg6l| f Tr a62j . In our case, the domain of 
interpretation is the class of finite N-free posets. 

Monadic second-order logic is an extension of first-order logic that allows to quantify 
over elements as well as subsets of the domain of the structure. Given a signature £, one 
can define the set of MSO-formulus over £ as well-formed formulae that can use first-order 
variable symbols x, y ,... interpreted as elements of the domain of the structure, monadic 
second-order variable symbols X, Y ,... interpreted as subsets of the domain, symbols from 
£, and a new unary predicate X(x), also denoted x E X for readability, interpreted as “the 
interpretation of x belongs to the interpretation of X ”. We call sentence any formula without 
free variable. As usual, we will often confuse logical symbols with their interpretation. 

Given a signature £ and an £—structure M with domain D, we say that a relation 
R C D m x V(D) n is MSO-definable in M if and only if there exists an MSO-formula over £, 
say ip(x±, ..., x m , Xi, ..., X n ), which is true in M if and only if (aq, ..., x m , X\, ..., X n ) is 
interpreted by an (to + n)-tuple of R. 

Given a finite alphabet A, let us consider the signature Ca = {<, (P a )aeA} where < is a 
binary relation symbol and the R a ’s are unary predicates (over first-order variables). One can 
associate to every poset (P, <, p) labeled over A the £q—structure M(p <)P ) = (P; <; ( R a ) a &A) 
where < is interpreted as the ordering over P, and R a (x ) holds if and only if p(x ) = a. 
In order to take into account the case P = 0, which leads to the structure Mg which has 
an empty domain, we will allow structures to be empty. Given an MSO sentence ip over 
the signature Ca, we define the language as the class of posets (P, <,p) labeled over 
A that satisfy ip, or, using formal notation, such that M^ P< f ^ |= ip. Two formulae ip and 
ip' are (logically) equivalent , denoted by ip = ip', if L.^ = L^>. We will say that a language 
L C SP + (A) is definable in MSO logic (or MSO-definable) if and only if there exists an 
MSO-sentence ip over the signature Ca such that L = L^,. 

In order to enhance readability of formulae we use several notations and abbreviations 
for properties expressible in MSO. The shortcut a(x) is used instead of R a (x). The following 
are usual and self-understanding: (p ip, X C Y, x = y. An existential (resp. universal) 
quantification 3xip(X) (resp. Vxip(X)) is relative to X if 3xip(X) = 3x x G X A ip(X) (resp. 
\/xip(X) = Vx x £ X —> ip{X)). Relative existential (resp. universal) quantification of x 
over X is denoted (resp. M x x). The notion of relative quantification naturally extends 
to second-order variables. 

MSO logic is strictly less expressive than automata. There is no MSO-formula that 
defines the language (a || b )®. On the contrary, MSO-definability implies rationality. 

In order to capture the expressiveness of automata with logic we need to add Presburger 
expressivity to MSO. Presburger logic is the first-order logic over the structure (N, +) 
where + = {(a, b, c) : a + b = c}. A language L C N n is a Presburger set of N n if 
L = {(xi, ..., x n ) : ip(x i,..., x n ) is true } for some Presburger formula <p(xi, ..., x n ). If 
cp(x i,..., x n ) is given then L is called the Presburger set of tp(x i,..., x n ) (or of p for short). 
Presburger logic provides tools to manipulate semi-linear sets of A® with formulae. Indeed, 
let A = {ai,... ,a n j be an alphabet. As a word u of A® can be thought of as a n-tuple 
(|u| ai ,..., \u\a n ) of non-negative integers, where |it| a denotes the number of occurrences of 
letter a in it, then A® is isomorphic to N n . 
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Example 5.1. Let A = {a,b,c} and L = {u G A ® : |rt| a < \u\b < |u| c }. Then L is 
isomorphic to {(n a ,?ib,n c ) £ N 3 ■ n a < nb < n-c}, and thus the Presburger set of 

p(n a , n b , n c ) = (3x n b = n a + x) A (3y n c = n b + y) 

Semi-linear sets and Presburger sets are connected by the following Theorem: 

Theorem 5.2 (Ginsburg and Spanier, Theorem 1.3 of GS66 1. Let A = {ai,... ,a n } be an 
alphabet and L C A*' . Then L is semi-linear if and only if it is the Presburger set of some 
Presburger formula ip(x \,..., x n ). Furthermore, the construction of one description from 
the other is effective. 

The P-MSO logic is a melt of Presburger and MSO logics. From the syntactic point of 
view, P-MSO logic contains MSO logic, and in addition formulae of the form 

Qx(Z, . . . , (lfn(X),X n ), ip(xi, ..., X n )) 

where X is the name of a new second-order variable, Z is the name of a (free) second-order 
variable, ifi(X) (for each i E [n]) a P-MSO formula having no free first-order variables, 
and only quantifications relative to X, and cp(x i,..., x n ) a Presburger formula with n free 
variables x\,... ,x n . As the variable X is for the internal use of Qx , then it is bounded 
by Qx- it is a free variable of all the if t (X), i £ [n], but it is not a free variable of 
f>(Z) = Q x (Z,('ifi(X),xi),...,(il> n (X),x n ),(p(xi,...,x n )). Similarly, xi,...,x n are free 
variables of <p(xi ,..., x n ), but they must not be considered as free in f>(Z). 

As in monadic second-order logic, the class of syntactically correct P-MSO formulas is 
closed under boolean operations, and existential and universal quantification over first and 
second-order variables of a P-MSO formula that are interpreted over elements or sets of 
elements of the domain of the structure. Semantics of P-MSO formulae is defined by extension 
of semantics of Presburger and MSO logics. The notions of a language and definability 
naturally extend from MSO to P-MSO. Let us turn to the semantics of 

Qx(Z, (ifi(X),xi ),..., {ij} n (X),x n ),ip(x i,... ,x n )) 

A block B of a poset ( P , <) is a nonempty subset of P such that, if b,b' E B such that 
b < b ', then for all elements of p £ P, if b < p < b' then p E B. A subset G of P is good 
if, for all p £ P, if p is comparable to an element of G and incomparable to another, then 
p € G. A connected block C of a block A of a poset is a block such that, for any different 
and incomparable c,d £ C there exists d’ E C such that c, d < d' or d' < c,d. 

Before continuing with formal definitions, let us give some intuition on the meaning of 
Qx(Z, (ifi(X), x \),..., ( r f n (X), x n ), <p(xi ,..., x n )). Let Y be an interpretation of a second- 
order variable Z in P, such that Y is a good block of P. That means, Y is the poset 
associated with a sub-term of a term on A describing P, and is the parallel composition of 
m > 1 connected blocks: Y = Y\ || • • ■ || Y m . Take n different colors c±,... ,c n . To each Yj 
we associate a color Cj with the condition that 1) satisfies ’ipj(Yj). Observe that this coloring 
may not be unique, and may not exist. Denote by Xj the number of uses of Cj in the coloring 
of Y. Then P, Y (= Qx{Z , fipi(X), x \),..., (ip n (X), x n ), ■ ■ ■ > x n)) if there exists such a 
coloring with x\,... ,x n satisfying the Presburger condition <p(x i,..., x n ). 

More formally, let P G SP + (A), Q X (Z, (?/q(A), aq),..., (gf n (X), x n ), <p(xi,.. .,x n )) be 
a P-MSO formula, Y C P be an interpretation of Z in P such that Y is a good block of P. 
Then P, Y |= Qx(Z, (ipi(X), x {),..., (jjj n (X),x n ), ip( x i, ■ ■ ■, x n )) if there exist non negative 
integers v±,... ,v n and a partition (lip,..., Y] Vl ,..., Y n q,..., Y njVn ) of Y into connected 
blocks Yi j such that 
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• (vi,, v n ) belongs to the Presburger set of ip{x\, ..., x n ), 

• y E Yij, y' E Yy implies that y and y' are incomparable, for all possible (i,j) and ( i',j') 
with (i,j) ± 

• P, Yij J= ipi(Yij ) for all i E [n] and j E [vi\. 


5.1 


Example 5.3 

formula of Example 

is a MSO formula (thus a P-MSO formula) which is true if and only if the interpretation of 
X has cardinality 1. Then L is the language of the following P-MSO sentence: 


Let L b e the language of Example 5.1 and (p(n a , n b , nc ) be the Presburger 
For all a € A, set if a (X) = Cardi(X) AV x x a(x), where Cardi(A) 


VP (Vp p£ P ) -» Qx(P,('i/Ja(X),na),('ilJb{X),n b ),('ip c (X),nc),ip(na,n b ,nc)) 


Theorem 5.4. Let A be an alphabet, and L C SP + (A). Then L is rational if and only if is 
P-MSO definable. Furthermore the construction from one formalism to the other is effective. 


The remainder of this Section is devoted to the proof of Theorem 5.4 


5.1. Prom automata to P-MSO. The inclusion from left to right of Theorem 5.4 relies 
on the ideas of Biichi on words: the encoding of accepting paths of a branching automaton A 
into a P-MSO formula. More precisely, for words this part of the proof consists in mapping 
each letter of the word to a state of the automaton, consistently with the transitions. In our 
case, each letter of the poset is mapped to a sequential transition of A, and each part of the 
poset of the form P = P\ || • • ■ || P n (n > 1), as great as possible relatively to inclusion and 
such that each Pi is a connected block of P, is mapped to a pair (p, q ) of states; informally 
speaking, p and q are the states that are supposed to respectively begin and finish the 
part of the path labeled by P. The formula guarantees that pairs of states and sequential 
transitions are chosen consistently with the transitions of A , and that, if P = P\ || • • • || P n 


P 

as above and p t ==>■ q % for all i E [n], then there exists a combination of fork transitions that 
connects p to p±,... ,p n , a sequence of join transitions that connects q\, . . ., q n to q, such 

p 

that a path q in A is formed. 

Let us give this construction more formally. 

Given two second-order variables representing sets, the properties X C Y, X C Y, 
X n Y = 0 are clearly definable in MSO. The equality x = y of two elements is clearly 
expressible with a MSO formula, that states for example that {x} C {y} A {y} C {x}. We 
denote “there exists an unique x” by 3!x, “x and y are different and not comparable” by 
x || y, “there exists a non-empty set X ” by ELY, “set X has cardinality j” by Cardj(X), 
where j is any integer, “set X has cardinality > j ” by Card>j(YT), where j is any integer, 
“there exists (resp. for all) x in X ” by 3 x x (resp. V x x), U X contains all the elements” by 
Universe(X). All those properties are definable in MSO. In the further we will use the 
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following shortcuts: 

y < X =Vx x £ X —> y < x 
y || X =(-iy £ X ) A (Vx x G X —> y || x) 

X || Y =X n Y = 0 A VxVy(x GXAyGT)-*x||y 
Pred(X, y) =Vx x£X^-3yy£YAx<y/\ -<3z x < z A z < y 
A VyVx(y G T A x < y A -3 2 x<z<y)—»xGX 
Antichain(X) =\/ x x\/ x x' -i(x < x y V / < x) 

Min(M, X) =M C X A Antichain(M) A V X x3 M m m < x 

The formal definitions of a block, a good block and a connected block can be directly 
translated into MSO formulae as follows: 

ConnectedBlock(C, X) =Block(C, X) A (V C cV C c' (c 7 ^ c A Incomp(c, d)) —> 

3 c c" Comp(c, c") A Comp(c / , c')) 

GoodBlock(i?, X") =Block(i?, X) A (V i? rV R r / V x x(Comp(x, r) A Incomp(x, r')) —> R(x)) 
Comp(x, y) =x < y V y < x 
Incomp(x,y) =(->x < y) A (->y < x) 

Block(i2, X") =i? C X A Card>o(i?) A (V /? rV X xV / V r<xAx<r'-> i?(x)) 


Let ^4 be an alphabet and f be a rational language of given by an automaton 

Vl = (Q,A,E,I,F) and P G SP + (A). Following Biichi’s ideas [BiicfiO] . this section is 

p 

devoted to the construction of a P-MSO sentence 4 such that i =>■ / for some i £ I, f £ F, 

A 

if and only if P \= <^ 4 . 

Informally speaking, the sentence annotates P using second-order variables that encode 
the transitions of A, consistently with the notion of a path. To each transition (p, a, q ) we 
attach a second-order variable X^ paq p and use the following formula 


MarkSeq = Vx A (a(x) —> V Xi vag \(x)) 

a£A (p,a,q)£E ’ ’ 

A (Vx A (X (?wz) (x)^ 

(p,a,q)eE 


V 

a'gyl 

(p / ,a / ,q / )gi? 
(p' ,a' ,q')^t(p,a,q) 




in order to express that each element of P labeled by a is the label of a unique transition 
( p,a,q ) in a path. In order to express that if ( p,a,q ) and ( p',b,q') are two consecutive 
transitions in a path then q = p', we use the following formula 


ConsistentSeq = VxVy(Succ({y}, {x}) A Pred({x}, {y})) 

A. 

(p,a,q)eE 


,q)( x )^ , X {giM (y)) 

( q,b,r)£E 


We now turn to a more technical part of the construction of 4 : expressing that each part 
of P of the form R\ || • • • || R n , n > 1, is the label of path from a state s to a state t that 
uses fork and join transitions consistently to the definition of a path in A. Second-order 
variables X~ t and X+ t are used to express that the part of the path labeled by R starts in s 
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Figure 7: A N-free poset and its good maximal parallel blocks. 

and finishes in t. The sets X~ t and Xf t are composed of particular elements of P which are 
named witnesses of R , and are attached unambiguously to R. 

A good maximal parallel block of X C P is a good block R of X such that R = P\ || P 2 
for some nonempty P\ and P 2 , and maximal relatively to parallel decomposition, ie. there is 
no good block R' of X such that R! = R || P 3 for some nonempty P 3 . The property “R is a 
good maximal parallel block of X” can easily be translated into a formula GMPB(i7, X). 

GMPB(i?, X) = R CIA GoodBlock(i2, X) A (3Ri3R 2 R = Ri || tf 2 )A 

Vi?' (R! CIA GoodBlock(7? / , X) A 3R\3R , 2 R! = R,\ || R 2 ) -A 3T R' = R || T 

Example 5.5. Figure [7] represents a N-free poset P and its good maximal parallel blocks G\, 
G 2 , G 3 and G 4 . The good maximal parallel block G\ can be decomposed into G\ = C\ \\ C 2 
where C\ = {i} and C 2 = G\ — C\ are two connected blocks. The good maximal parallel 
block G 4 has two connected blocks: {/} and {5}. Any N-free poset P can be represented 
by a labeled tree, as in the figure, where internal nodes are labeled by sequential or parallel 
products, and leaves by elements of P, and such that no internal node has the same label as 
one of its sons. Formally, these trees are not terms as we defined them because of the arity 
of internal nodes. Good maximal parallel blocks of P correspond to the sub-trees whose 
root is a node labeled by |j. 

The following Lemma holds. 

Lemma 5.6. Let P be a N-free poset, X C P, and R, R' be two good maximal parallel blocks 
of X. Then either R < R! or R 1 < R or R || R' or R C R' or R! C R. 

Proof. First assume R n R' = 0. Let Ri,R 2 ,R , i,R 2 be such that R = R.\ || and 

R! = R ! 1 || i? 2 - Assume that there exist r £ R, r' G R' such that r and r' are comparable. 
Wlog suppose r < r'. As R is a good block, then x < r' for all It follows easily that 

R < R'. If all the elements of R and R! are incomparable then R || R'. 

Assume now that there exists x € RC\ R!. Assume that one is not included into the 
other. Thus there exist r £ R — R! and r' G R' — R. Assume that x and r are comparable, 

say wlog. r < x. If r and r' are incomparable then r G R' because R' is a good block. So 

necessarily r < r' otherwise R' would not be a good block. If x and r / are incomparable then 
R is not a good block, so x and r ' are comparable and necessarily x < r' otherwise R would 
not be a block. Now, as R and R' are both parallel blocks, there exist a G R and b E R' such 
that a is incomparable to r and x , and b is incomparable to r' and x. As R and R' are good 
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blocks then a < r' and r < b. Because P is N-free then a < b. Thus {r, x, a, b} forms an N, 
which is a contradiction. So r (and r') is uncomparable to x. As a consequence r and r' 
are also incomparable, otherwise R and R' would not be good blocks. As r is incomparable 
to x, we also have that if R = R\ || ■ • • || R n is a decomposition of R and x E Ri for some 
i E [n] then C Rni?'. Also, decomposing R' into R' = R\ || ■ • • || R' m , if Ri, R) $2 R n R 1 , 
then the elements of Ri are incomparable to those of R'-. Thus, consider R U R': it is a 
parallel block. Assume it is not good. There exist x E X — (R U R'), a,b E RU R! such that 
x is comparable to a and incomparable to b. If a, b E R (resp. a, b E R') then R (resp. R') 
is not a good block. So assume a E R — R' and b E R' — R. Necessarily r is comparable 
to all the elements of R and incomparable to those of R'. Thus Rn R' = 0, which is a 
contradiction. As a conclusion, R U R' is a good parallel block, R, R' C R U R!, so R and 
R! are not maximal: the intersection of two good maximal parallel blocks is necessarily 
empty. □ 

The following Lemma shows that the set of minimum (or maximum) elements of a good 
maximal parallel block G characterizes G : 


Lemma 5.7. Two different good maximal parallel blocks of P can not have the same set of 
minimum (resp. maximum) elements. 


Proof. Assume by contradiction that G and G' have the same set M of minimum elements 
(the proof is the same if M is the set of maximum elements). According to Lemma 5.6 
G C G l or G’ C G, say wlog. G C G’. There exists x E G’ — G. Necessarily x 0 M, thus 
there exists mi E M such that m± < x. Observe that x is necessarily greater than all 
the elements of M, otherwise G 1 would not be a good part of P. But this implies that G' 
can not be decomposed into G l = G\ || G' 2 , so G' is not a good maximal parallel block, in 
contradiction with the hypothesis. □ 


Let G be a good maximal parallel block of a N-free poset P. The set of witnesses of G 
is the union of the set of left witnesses of G and the set of right witnesses of G, respectively 
denoted WUl{G) and WUr(G) and defined by: 

WUl(G) :the greatests x £ P such that x < G and there is no y € P — G 
such that y || G and x < y; 

Witn(G) :the smallests iff such that x > G and there is no y e P — G 
such that y || G and x > y. 

or equivalently: 

WUl{G) = {w G rnax{x' e P : x < G} : Succ{w ) = min(G')} 

WUr{G) = {w G minjx' G P : G < x} : Predfw) = rnax(G')} 


Example 5.8. In Example 5.5 


• Wit L (Gi) = 0 and Wit R (G i) = {j}, • Wit L (G 3 ) = {a,b} and Wit R (G 3 ) = {e}, 

• WUl{G 2 ) = 0 and Wit^Gf) = {c, d}, • WUl{G/i) = {e} and Wit R (G/i) = {h}. 


Example 5.9. Figure [ 8 ] represents a N-free poset P and its good maximal parallel blocks 
Gi, G 2 , G 3 and G 4 . We have 
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Figure 8: A N-free poset and its good maximal parallel blocks. 


• Wit L (Gi) = 0 and Wit R (G i) = {j}, • Wit L (G 3 ) = {a,b} and Wit R (G 3 ) = {/}, 

• WUl(G 2 ) = 0 and Wit R (G 2 ) = {c, d, e}, • = {/} and Wit R (Gi ) = 0. 


Observe that / is both a right witness of G 3 and a left witness of G 4 . Observe also that 
j is not a right witness of G\. 

Let us denote W~{G) = Pred(minG), W + (G ) = S'ucc(maxG) and W(G) = W~{G) U 
W+(G). 

Lemma 5.10. Either W~(G) or W + (G) or W(G ) is the set of witnesses of G. 

Proof. Observe that Wit^G) C W~(G) and Wit R {G) C W + (G), thus the witnesses of G 
are a subset of W ( G ). Observe also that if x E W~ (G) (resp. W + (G)) is a witness of G , then 
all the elements of W~(G) (resp. W + (G)) are witnesses of G. Indeed, let G = G\ || ■ ■ ■ || G n 
with n > 1 and x G W~{G) such that x is a witness of G. Then x is a predecessor of a 
minimum of G. As G is a good block, then x is also less than all the minimums of G, and x 
is necessarily a predecessor of all those minimums. As in a N-free poset all the predecessors 
of an element have the same successors it follows that if x is a witness of G then W~(G ) 
contains only witnesses of G. We argue similarly with the elements of W + (G). Now let 
w E W(G). Wlog., assume that w E W~{G). Assume that w is not a witness of G: there 
exists r such that r || G and w < r. Let R = GB(r,G ) be the smallest block of elements 
r' E P such that 

• re GB(r, G ) and 

• if r' is comparable to some r" E GB(r , G ) and r' || G then r' E GB(r, G) 

Observe that GB(r 1 G ) is a good block for all r such that r || G. Indeed, assume that it is 
not: there exist r'. r" E GB(r, G), g E G and t E P — GB(r, G) such that t is comparable to 
g and r' and incomparable to r", and it is just verification to check that P is not N-free. 

Assume there exists r' E R such that w is incomparable with r'. If r' < r then w,g,r',r 
form an N for any g E G. If r' and r are incomparable then there exists r" E R such that 
either r, r' < r" or r" < r, r ', otherwise R would not be as small as possible. If r, r' < r" 
then w, g, r", r' is an N for any g E G. The case r" < r, r' is similar. Thus w < R. 

As G U R is not a good parallel block, but G and R are, there exists z e P — (G U R) 
such that 2 : is comparable to all the elements of G and incomparable to all the elements of 
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R, or z is incomparable to all the elements of G and comparable to all the elements of R. 
As this latter case would imply that z G R, then only the first case is possible. If z < G 
and z || R then z and w are different, and z || w is impossible, otherwise there would be an 
N formed by w, z, r and any element of G. It is impossible that z < w because it implies 
z < w < r. It is also impossible that w < z < G. Now take z as small as possible such 
that G < z and z || R. By contradiction, assume that z is not a witness of G; there exists 
p G P — (R U G) such that p < z and G || p. Let Q = GB(p , G). Then Q is a good block, 
Q || G, Q || R and Q < z. As Q U G can not be a good parallel block (otherwise G would 
not be a maximal good parallel block), then exists t G P — (Q U G) such that either 

• t is comparable to an element of Q and incomparable to an element of G, and so to all 
the element of G because G is a good block; this implies t G Q which is in contradiction; 

• or t is incomparable to an element of Q (and thus to all the elements of Q) and comparable 
to an element of G (and thus to all the elements of G). They are two cases. If t < G then 
t is necessarily incomparable and different to w, thus for all g G G, t,p,w,g form an N, 
which is a contradiction. Otherwise, if G < t, then necessarily t < z and t || R, and thus z 
is not as small as possible such that G < z and z || R, which is also a contradiction. 

Thus z is a witness of G , and z G W + (G). 

We use similar arguments to show that if x G W + (G) is not a witness of G then W~{G) 
is the set of witnesses of G. □ 

Corollary 5 . 11 . Every good maximal parallel block G of P which is not P itself has a 
witness. 


Proof. If W (G) is empty then there is no p G P — G which is comparable to an element 
of G. Thus (P — G) 7^ 0 is a good block, and (P-G)llGa good parallel block, which 
contradict the maximality of G. If W~{G) is not empty but has no witnesses of G, then 
using the same arguments as in proof of Lemma | 5.10 the witnesses of G are the elements of 
W + (G) which is not empty. Similarly, if W + (G) is not empty but has no witnesses of G 
then the witnesses of G are the elements of W~{G) which is not empty. d 


Lemma 5 . 12 . Let P be a N-free poset, and a,b,c,d G P such that a is a predecessor ofb, c 
a predecessor of d, a < d and c < b. Then a is a predecessor of d and c a predecessor ofb. 

Proof. Necessarily a || c and b || d. By contradiction assume that the statement of the 
Lemma is false, for example that a is not a predecessor of d. Then, there exists x successor 
of a such that x < d. Then x is incomparable with b and to c, thus x, a, b , c is an N. U 


Lemma 5 . 13 . Every x G P is a left (resp. right) witness of at most one good maximal 
parallel block of P. 


Proof. Assume that x is a left witness of two different good maximal parallel blocks G\ and 
Gd Then x G W~ {G\)C\W~ {G2), and thus there exists g\ G min G\ , g2 G minG^ such that 
a; is a predecessor of both g\ and <72 • A consequence of the definition of left witness is that 
gi G G2 (and thus g\ G minG^) and 52 G G\ (and thus g2 G min G\). Now, let y G W~{G\). 
There exists g[ G minGi such that y is a predecessor of g[. According to Lemma 5.10 
left witness of G\ , and thus y < g 1 and x < g[. As a consequence of Lemma 5 
both predecessors of gi,g[,g2- It follows that W~{G\) = W~{G2), and as a consequence 
minGi = ruin GL■ Assume that G\ / G2, and wlog. that there exists <72 £ G2 such that 
92 0 G\. Then 52 is comparable to a minimum m of G2, which is also a minimum of G \, 
and, because G2 can be decomposed into G2 = G2 1 || G2 2, 92 is incomparable to another 


y is a 


x, y are 
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minimum m! of G 2 , which is also a minimum of G\. Thus, because G\ is good, we should 
have g 2 £E G\, which is a contradiction. M 

Being a left witness of a good maximal parallel block G can easily be encoded into a 
MSO-formula: 


Wit£,(.x, G) = 3M3R Min(M, G) A Pred (R, M) A R(x) 

A similar formula Wit#(x, G) can be written for right witnesses. 

We now come back to the definition of a P-MSO formula that expresses that the (strict) 
part of P identified by X is the label of a path in A. To each good maximal parallel block 
G of X we attach a unique couple of states (p, q) £ Q x Q using the witnesses of G, with 
the help of a two second-order variables, X~ q for left witnesses and X+ q for right witnesses: 


MarkPar = VXVG Universe(X) A GMPB(G, A) —» 

X ~ q (x)) A (WitR(x,G) 

((Wit L (x,G) A A“ g (x)) - 


V (V x x(Wit l(x,G) 

(p,q)eQxQ 

J X „ 


A (V x A 

(p,q)eQxQ' 


V 

(p 1 ,q')£QxQ 


X+ q (x))) 


A (\/ x x A ((Wit fi (x, G) A X+Ax)) 
(p,g)eQx<? 


, V x+ ,(*))) 

{p',q')e.QxQ p ,q 
(p' ,q')^(p,q) 


and we check that 

• for every block of P of the form Ga with G a good maximal parallel block of P and a E A, 
if p =A- g and s ==> r then q = s (and symmetrically for blocks of P of the form aG ); 

• for every block of P of the form GG' with G , G' good maximal parallel blocks of P, if 

p ==> a and s ==4> r then q = s. 

A A 

The check is done with the formula ConsistentPari below. For convenience, we start 
by defining a formula GMPBMinStarts g (M, G, X) for every q E Q such that P,X,M,G \= 
GMPBMinStarts g (M, G, X) if and only if G is the good maximal parallel block of X whose 
set of minimum elements is M (the uniqueness of G is guaranteed by Lemma 5.7) and the 
part of the path labeled by G starts with q. 


GMPBMinStarts g (M, G, X) = GMPB(G, X) A Min(M, G) 

Vx((Wit L (x,G) ->• V X~ (x)) A (Witjt(x,G)) -> V X+(x)) 

r£Q reQ 

Formulae GMPBMinEnds q (M,G,X), GMPBMaxStarts q (M,G,X) and GMPBMaxEnds g (M, G, X) 
could similarly be written in order to define the good maximal parallel block G according to 
its set of minimal/maximal elements, and to express that the part of the path labeled by G 
starts/ends with q. 



LOGIC AND BRANCHING AUTOMATA 


31 


Returning to the definition of ConsistentParr: 

ConsistentPari = VXVGVM _ VM + \/y _ \/y + 
Universe(X)AGMPB(G,X)AMin(M-,G)AMax(M+,G)APred(y-,M-)ASucc(y + ,M+) -A 
Vx A ((Wit L (x,G) AX~ (x)) V (Vit R (x,G) Al + „(x))) -A 

(p,q)£QxQ ^ ^ 

((Card!(y + ) -A Vy Y + (y) -A V X q a r (y))A 

aeA 

r£Q 

(' q,a,r)£E 

(Card>i(y + ) -A VG' GMPBMinStarts g (y + , G', X))) 
A 

((Cardi(y _ ) -A Vy Y~(y ) -A V X rap (y)) A 

aG A 
rGQ 

(r,a,p)£E 

(Card>i(y _ ) aVG' GMPBMaxEnds p (y _ , G', X))) 


Now we define Vv,<?(^0 that tests if the connected block X begins in p and ends in q. 


i(>p, q (X) = \/ x M~\/ x M + Min(Af“,X) AMax(M+,X) -A 

((Cardi(M“) -A Vm M~(m ) -A V X p ar (m))A 

aG A. 5 ’ 

rGQ 

(Card>i(M _ ) aVG' GMPBMinStarts p (Af~, G', X))) 

A 


((Cardi(Af + ) -A Vm M + (m) -A V X r ^ a „(m))A 

aG A ’ ’ 

reQ 

(Card>i(Af+) -A VG' GMPBMaxEnds 9 (Af + , G', X))) 

Observe that in ij> P} q(X) all quantifications can be assumed relative to X. Actually, the 
formula GMPBMinStarts p (AfG', X) ensures that G 1 CX, and, as X is a connected block 
and G'cX, the witnesses of G' are necessarily in X. 

Let Q x Q = {(pi, gi),..., (p n , q n )}- For every (p, q) e Q x Q we define a P-MSO 
formula Qy(X, (V> Pl , gi (y), xi {ip Pn ,q n (Y), x n ), <p p , q (xi,x n )) such that, for any good 
maximal parallel block G of P, 


Pi G I— Qy(G, (lppi,qi (I )j x l)i • • • ) (V'pmgn 00, X n ), ^Pp,q{x 1, • • • , X n )) 


if and only if G can be decomposed into G = G 1.1 || • • • || G\ tXl || • • • || G n p || • • • || G U)Xn 
G ■ ■ 

such that pi =4- q t for every i € [ra] and j € [x;], and {(pi, gi) Xl ,. ■ •, (p n , q n ) Xn } £ In 
other words, P, G \= Qy(G, (^ puqi (Y),x i),..., (ip Pn , qn (Y), x n ), ip p , q (xi, .. .,!„)) if and only 


G 


if there is a path p ==> ( 7 . 


A 


As a consequence of Lemma 4.14 and Theorems 3.9 and 5.2, P P) q is the Presburger 
set (over the alphabet Q x Q) of some Presburger formula <Pp,q{ x 1 , • • •, x n ), from which we 
deduce directly Qy(X, (VWi ( y )> ..., (VWnOO> x„), <p p , 9 (xi,... ,x n )). 

We now write a formula ConsistentPar 2 that applies the adequate 


Qy(G , (0>1,<?1 (F"), Xl), . . . , {lt’p n! q n (y), X n ), lf P: q(x 1, • • • , X n )) 
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to all good maximal parallel blocks of P (except P itself if it is a good maximal parallel 
block). 


ConsistentPar2 = VXVG(X C P A GMPB(G,X)) — > 

Vx A (((Wit L (x,G) AX~ (x)) V (Witx(x,G) Al + Jx))) -A 
(p,9)e QxQ 

Qy{G, (^pi,gi (^0) ^l)> • ■ ■ > {' t Pp n ,qnO^)j x n)i Pp,q( x 1) • • • 1 x n))) 

We are ready to define a P-MSO sentence 0 j\ that defines L. Set ConsistentPar = 
ConsistentPari A ConsistentPar2- Observe that for any nonempty N-free poset, either 
P = P\ || P2 for some nonempty P\,P 2 , in which case P is a maximal good parallel block of 
itself, or P is a connected block. Assuming A = {a, &}, define 


4>A — 3-X’(pi )ai9 i)3X(pi i 6 i9 i)3X pii9l 3X^ )?1 .. • 3-X’(p Tl , 0 , g „)3X(p n> j, ign )3X pn)?n 3X^ i!3Ti 

MarkSeq A ConsistentSeq A MarkPar A ConsistentPar A (VX Universe(X) —> 

((Card > 0 (X) -A ((GMPB(X,X) -A 

.V Qy(X, ( il> Puqi (Y ), Xl ),..., (il’ Pn # n (Y),x n ),<pij(x 1,.. .,x n )) 

feF 

A ((—iGMPB(X, X)) -A V MX)))) 

i£l 

feF 

A (Cardo(X) —>• V true)) ) 

ielnF 


Then P |= cj>A if and only if there is a path i 



f for some i E /, / G F. 


5 . 2 . From P-MSO to automata. Let A be an alphabet and i/j(x 1,..., x n , Xi,..., X m ) be 
a P-MSO formula which has a set V\ = {xi, ..., x n } of free first-order variables interpreted 
over elements of posets (we do not consider here the variables that are interpreted over 
non-negative integers) and a set V2 = {Xi,..., X m } of free second-order variables. A (Vi, V2)- 
poset labeled by A is a N-free poset ( P , <,p) labeled by A xV(V\) x VCV2) such that for all 
i G [n] there exists exactly one p G P such that Xi G 112{p{p)), where ,..., c r )) = (k G 

[r]) is the projection of a tuple on its A; th component. Observe that a poset labeled by A can 
be viewed as a ( 0 , 0 )-poset labeled by A. Observe also that an interpretation of the variables 
xi ,..., x n , Xi,..., X m in P induces a unique (Vi, V2)-poset P(x 1,..., x n , Xi,..., X m ), and 
reciprocally. This allows us to use indifferently one representation or the other in order to 
lighten the notation. The (Vi, V2)-posets are a generalization from words to N-free posets of 
an idea of |PP86] . 

This section is devoted to the construction of an automaton on the alphabet 
A x V(Vi) x V(V 2 ) such that P,x 1,... ,x n ,Xi,... ,X m |= ip(xi,... ,x n ,Xi,... ,X m ) if and 
only if P(x 1,..., x n , X \,..., X m ) G L(A ^) for any P G SP + (A). If ip is a sentence, then 
P |= ip if and only if P G L(A^). The construction of A^ is by induction on the structure 
of ip, and is a generalization of the well-known construction of a Kleene automaton from 
a MSO-formula when MSO is interpreted over words (see for example |Str 94 | for a clear 
presentation of this case). 

It is easy to build an automaton -A(vi, v^l-poset that tests if a poset P G SP + (A) labeled 
by A x V(Vi) x P(V2) is a (Vi, V2)-poset for some Vi, V2. It suffices to test in P, for each 
v G Vi, if v appears exactly once into the sets that appear as second components of the 
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letters. Example | 3 . 2 | exhibits an automaton that tests if a particular letter a appears at least 
once: it can easily be transformed in order to test if a appears exactly once, from which we 
deduce -A(Ui, lAj-poset - 

As a consequence of Proposition | 3 . 6 | and Theorem 
of branching automata are (Vi, V^-posets. 

The construction of an automaton A Xi<Xj that tests if Xj < Xj for some first-order 
variables in the input (Vl, V^-poset has V(V \) U {_L} as set of states with _L a sink state. 
The state 0 is the only initial state and all states V £ V{V\) such that x t , Xj £ V are final. 
The sequential transition from state V £ V(Vi) labeled by (a, W±, W2) goes to 

• X if Xi 0 V and Xj £ W\, or Xj, xj £ W \; 

• V U W\ otherwise. 

Each state s which is not T is the source of a fork transition (s, {s, s}), and for every pair of 
states {W \, W2) £ 'P(Vi) x V(V \) there is a join transition ({Wi, W2}, W\ U W2). 

As the test automata for the atomic formulae a(xj) and Xj(xj) use the same principle we 
only give the construction of an automaton Ax i ( Xj ) that tests the latter. It has Q = {A,T} 
as set of states with A as unique initial state and T as unique final state. The only sequential 
transitions from 1 to T are labeled by (a, W\,W2) such that Xj £ W\ and Xi £ W2. All 
other sequential transitions are from s to s for all states s. The fork transitions are ( s , {s, s}) 
for all states, and the join transitions are ({si, S2}, s 3) where S3 = T if at least one of s 1, S2 
is T, S3 = X otherwise. 

Constructions of automata for the boolean connectors V, A and -1 are a consequence of 
Proposition | 3 . 6 | and Theorem | 4 . 1 | 

Assume now that ip(x \,..., x n , X \,..., X m ) is a P-MSO formula with free first-order 
variables V\ = {x \,..., x n } and free second-order variables V2 = {X \,..., X m }. Assume that 
by induction hypothesis an automaton A^ = (Q, A x V(V\) x V(V2), E, /, F ) can effectively 
be constructed from ij)(x 1,..., x n , X±,... , X m ), and let i £ [n]. We use A^, in order to build 
an automaton A^ Xi ij) = (Q x B, A x V(V\ — {xj}) X V(V2),E', I x {false}, F x {true}) such 
that P,xi,... ,Xj_i,Xj + i, ...,x n ,XX m |= 3 xii/j(x 1,.. .,x n ,X 1 ,..., X m ) if and only 
if P(xi,... ,Xj_i,Xj + i, ...,x n ,X 1} ..., X m ) £ L(ABxii>), for an y P G SP + (A). There is a 
sequential transition (( q , b), (a, IPj, W2), (q\ b )) £ E' if and only if (q. (a, ITj, W2), q') £ E and 
Xj ^ Wi, and a sequential transition ((q, false), (a, W\ — {xj}, W2), (q', true)) £ E' is and only 
if (q, (a, W\. W2), q') £ E and Xj £ Wi. There is a fork transition ((gi, 6), {(q2, b), (53, 6)}) £ 
E' if and only if (gi, {q 2 , 93}) £ E, and a join transition ({(gi, 61), (g2, b 2 )}, (g3, b± or b 2 )) £ E' 
if and only if ({gi, g 2 }, gs) G E. 

The construction for quantification over a second-order variable is similar to the one of 
first-order variable. 

Remark 5 . 14 . We have proved by all the constructions above that for any MSO-sentence 
i/j there exists an automaton A^ such that P \= ip if and only if P £ L(A^). 

We finally turn to the last case where ip has the form 

Qx(Z, (lpl(X),Xl), . . . , (lpn(X),X n ), ip(x 1, . . . , X„)) 

Recall here that xi,... ,x n are variables that are interpreted over non-negative integers, 
and that each ipi, i £ [n], has one free variable X, which is second-order, all quantifications 
relative to X and no free first-order variables. By induction hypothesis, there is an automaton 
A t p i such that P,R |= ipi(R) if and only if P, R £ L(A^ i ). According to the semantics of 
Qx(Z, (ipi(X),xi ),..., ( ip n (X ), x n ), ip(x i,..., x n )), the only interpretations of R in P verify 


4.1 from now we consider that inputs 
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( 1 ) R = P and ( 2 ) P is a connected block. The conjunction of ( 1 ) and ( 2 ) is a MSO-definable 
property of R, and thus according to Remark [ 5 . 14 | above it can be checked by an automaton 
B. As a consequence of Proposition 


3.6 


and Theorem 


4.1 


there exists an automaton A[ 


i>i 


such that Li = L(AL.) = L(A^ i ) n L(B). Now, let B = {b \,..., b n } be a new alphabet 
disjoint from A. As a consequence of Theorems | 5.2 


3 . 9 | and | 3 . 3 | there is an automaton C 
, x n ) over B. Then 


over the alphabet B such that L(C) is the Presburger set of <p(x i,... ,a 
L = L\ o bl (... (L n o bn L(C ))) thus L is regular according to Theorem 3.3 


Example 5 . 15 . Let A = {a, b} and L C SP + (A) be the language composed of the nonempty 

posets of the form P = P\ . P n , where each Pi, i E [n], has the form Pi = Pj i |j • • • || Pi, ni 

with each Pij a totally ordered nonempty poset (that is to say, a nonempty word), and such 
that for each i 6 [n] the number of Pjj that starts with an a is | n t . Set L\ = aA* = aUaA + 

and L2 = bA*. Then L is the language of the rational expression {{L\ || L\ || L 2 )®) + . We 

define L by a P-MSO sentence as follows. Given two elements of the poset denoted by first 
order variables x and y, one can easily write a MSO formula Sue c{x,y) (resp. Pred(x,y)) 
that is true if and only if x is a successor (resp. predecessor) of y. Set 

Lin(X) =y x x\/ x y\/ x z (Su cc(y,x) A Succ(^,x) -» y = z) A 

(Pred(y, x) A Pred(z, x) —» y = z) 
ipi(X) =Lin(A) A a(x) A V x y x = j/V x < y 
(A") =Lin(A) A 3 x x b(x) A \/ x y x = y V x < y 
ip(n a ,n b ) =n a = 2 n b 


Then L is the language of the following P-MSO sentence 


= VP(Vp p € P) -1 3Ai3A 2 P = X 1 © A 2 

AVt/((MaxBlock([7, X x ) VMaxBlock([/, X 2 )) -> Qy(U, (V'l (Y),n a ), (ip 2 (X),n b ),<p(n a ,n b )) 

with X = U © V = Partition(17, V, X) A (VuVu w£[/ArGbA || v). In the formula 
above, Partition(17, V, X) and u || v respectively express with MSO formulae that (U,V) 
partitions X, and that u and v are different and not comparable. The MSO formula 
MaxBlock([7, X) express that U is a block of X, maximal relatively to inclusion. 


6. Another definition for branching automata 

In this section we introduce another notion of automata for languages of SP + (A), which is 
actually equivalent, in expressiveness, to branching automata. 

Denote by Pres(n) the class of Presburger formulae <p(x \,..., x n ) with n free variables. 
A Presburger-branching automaton (or P-branching automaton for short) is a tuple A = 
(Q, A, E, I, F) where Q = {q\, ..., q n } is a finite set of states, A is an alphabet, / C Q is the 
set of initial states, F C Q the set of final states, and E = ( E seq , E{ 0 r k, Ej 0 j n ) is the set of 
transitions of A, which are of three kinds: 

• E seq C (Q x A x Q) contains the sequential transitions, which are usual transitions of 
Kleene automata; 

• -Eforit an d Ej 0 i n , respectively the fork and join transitions, are finite subsets of Q x Pres(n). 
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Let F be a multi-set of elements of Q. By p(F) we denote the Parikh’s image of F, ie. 
the element (rq,..., v n ) of N n such that F is the multi-set with iq occurrences of q % , for all 
i G [n]. Paths in P-branching automata are defined as in branching automata, except for 
the parallel case: for any finite set of paths {70,... ,7*,} (with k > 1) respectively labeled 
by P 0 ,..P k G SP + (A), from r 0 ,...,r k to s 0 ,..., s k , if t = (p, ip(xi,x n )) is a fork 
transition, t! = (q, ..., x' n )) a join transition, then 7 = t(\\j< k 7 j)t' is a path from p to 

q and labeled by || j< k Pj if p({r 0 , • • •, r k }) = (v 1 ,...,v n ) and p([s 0 , ..., s k }) = (v[,..., v' n ) 
are respectively in the Presburger set of tp and of p', and ^ie[n] v ii Sie[n] v i > 1 - 

The notions of accepting paths and languages of a P-branching automaton are defined as 
in branching automata. A language L C SP + (A ) is P-regular if there exists a P-branching 
automaton A such that L = L[A). 

Theorem 6 . 1 . Let A be an alphabet and L C SP + {A). Then L is regular if and only if it 
is P-regular. 


Proof. It is immediate that regularity implies P-regularity, since a fork transition (the same 
applies for join transitions) (p, {ro,... r k }) of a branching automaton can be interpreted 
as a fork transition (p, <p(xi,..., x n )) of a P-branching automaton, with p({ro ,... r k }) the 
unique element of the Presburger set of (p(x \,..., x n ). 

The converse needs more attention since the Presburger set of a formula <p may be 
infinite. Assume that A is a P-branching automaton with Q = {q \,..., q n } as set of states. 
We replace one by one all of its fork transitions (p, <p(xi ,..., x n )) by fork transitions of 
branching automata, by the following construction that add new states to A. Denote 
by Lp = {F G M >l (Q) : p(F) is in the Presburger set of <p}. Each element of L v can 
equivalently be represented as an element of Q®. We first build a branching automaton A v 
on the alphabet Q such that L(A ip ) = L^. It has exactly one initial state, which is not the 
destination of any transition, and one final state which is not the source of any transition. 
” L, 


By Theorem 5.2 


is a semi-linear set of Q'*’: 

= U Qji, 1 II ''' II Qji,i t II ( U 

i£l k&Ki 


% 


k, 1 


% 


k,i k I 


for some finite set /, disjoint finite sets Ki, i € I, with ji, r ,jk,s £ N for all i G I, k G Ki, 
r G [h], s G [l k ]. Wlog. we may assume that each multi-set of L v has at least two elements, 
so li > 1 for all i G /. Then A v is composed of one initial state 1, one final state /, two 
states qj~ and qj~ and one sequential transition (qj~, qj i r , qj~) for all i G /, r G [h], two 
states qj k r and qfff and one sequential transition (qj k r ,qj k r , qj k r ) for all i G I, k G ATj, 
r G [l k ]- For each i G I, there is one fork transition from 1 to all the qj~, r G [li], and 
symmetrically, one join transition from all the qj~, r G [li], to /. For each i G I, add also 
two states 17 and I7, a fork transition (17, {17, uf}), a join transition {{vd, uf}, uf), and, for 
all k G I\i, a fork transition ('ll,;, {qpff, ■ ■ ■, qj k !fc }) and a join transition ({qj k x , ■.., qj k } , uf) 
if l k > 1 , a sequential transition (ul, qj k 1 ,uf) if l k = 1 . For each i G I, add a fork transition 
I 1 ; and a join transition ({qj~f,... ,qJ~f~, 7 iF}, f). We have L(A lf ) = L^. 

Now, remove all sequential transitions, all join transitions, / and all states of the form q from 
A,_p and name A' v the new automaton. Consider the disjoint union of A and A!^. Remove 
(p, <p(x 1,..., x n )). Replace 1 by p and all states of the form q, q G Q, by q. Join transitions 
(p, p(x \,..., x n )) are removed with a similar mechanism. □ 
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7. Conclusion 


The effectiveness of the constructions involved in the proof of Theorem 5.4 have several 
consequences. The P-MSO theory S of SP + (A) consists of all sentences 4> of P-MSO such 
that P |= 4> for every P € SP + (A). The P-MSO theory of SP + (A) is decidable if there 
exists a decision procedure that tests if cj) 6 S. Because emptiness is decidable for languages 
of branching automata (see Proposition |3.7[ ), we have: 

Theorem 7.1. Let A be an alphabet. The P-MSO theory of SP + (A) is decidable. 


In |LW00j , Lodaya and Weil asked for logical characterizations of several classes of 
rational languages. As it is equivalent to branching automata, P-MSO is the natural logic 
to investigate such questions, that are still open. 

Extending the work of Lodaya and Weil, and those of Kleene-Schiitzenberger, Kuske and 
Meinecke [KM04| proposed to attach costs to paths in branching automata. They defined 
and studied branching automata with costs, and extended to this case the machinery from 
the theory of weighted automata. They provided in particular a Kleene-like theorem for 
branching automata with costs, in the particular case of bounded-width lan guages. 

Among the works connected to ours, let us mention Esik and Nemeth EN02) . which 
itself has been influenced by the work of Hoogeboom and ten Pas jHtP961 !HtP97j on text 
languages. They study languages of bi-posets from an algebraic, automata and regular 
expressions based point of view, and the connections with MSO. A bi-poset is a set equipped 
with two partial orderings; thus, N-free posets are a generalization of N-free bi-posets, where 
commutation is allowed in the parallel composition. 

MSO and Presburger logic were also mixed in other works, but for languages of trees 
instead of N-free posets. Motivated by reasoning about XML documents, Dal Zilio and 
Lugiez DZ 1.031 . and independently Seidl, Schwentick and Muscholl jSSMOSj . defined a 
notion of tree automata which combines regularity and Presburger arithmetic. In particular 
in [SSM08j . MSO is enriched with Presburger conditions on the children of nodes in order 
to select XML documents, and proved equivalent to unranked tree automata. Observe that 
unranked trees are a particular case of N-free posets. 
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